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O (54) Title: METHODS OF DIAGNOSIS OF BLADDER CANCER, COMPOSITIONS AND METHODS OF SCREENING FOR 
^ MODULATORS OF BLADDER CANCER 
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(57) Abstract: Described herein are genes whose expression are up-regulated or down-regulated in bladder cancer. Also described 
^5 are such genes whose expression is further up-regulated or down-regulated in drug-resistant bladder cancer cells. Related methods 

and compositions that can be used for diagnosis, prognosis, or treatment of bladder cancer are disclosed. Also described herein are 
1^ methods that can be used to identify modulators of bladder cancer. 
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METHODS OF DIAGNOSIS OF BLADDER CANCER, COMPOSITIONS AND 
METHODS OF SCREENING FOR MODULATORS OF BLADDER CANCER 

CROSS-REFERENCES TO RELATED APPLICATIONS 
5 This application is related to USSN 60/302,814, filed July 3, 2001; USSN 60/310,099, 

filed August 3, 2001; USSN 60/343,705, filed November 8, 2001; USSN 60/350,666, filed 
November 13, 2001; and USSN 60/372,246, filed April 12, 2001, each of which is 
incorporated herein by reference. 

1 0 FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein expression 
profiles and nucleic acids, products, and antibodies thereto that are involved in bladder 
cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis, and therapy of bladder cancer. The invention further relates to methods for 
1 5 identifying and using agents and/or targets that inhibit bladder cancer. 

BACKGROUND OF THE INVENTION 
In the United States, over 50,000 new cases of bladder cancer are diagnosed annually, 
and more than 10,000 deaths will be attributed to bladder cancer. Bladder cancer is now the 
20 fourth most common cancer among American men and the ninth most common cancer 

among American women. It occurs three times more frequently in men than in women, and it 
occurs roughly twice more frequently in white versus black men. 

Bladder cancer rarely occurs in people younger than 40 years of age, being primarily 
a disease of older men. Nonetheless, bladder cancer is a significant cause of illness and death 
25 in the United States. The risk of bladder cancer increases steeply with age, with over half of 
all bladder cancer deaths occurring after age 70. In white men older than 65, the annual 
disease rate of bladdercancer is approximately 2 cases per 1,000 persons; this contrasts with 
a rate of 0.1 cases per 1,000 persons younger than 65. 

1 
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Within the United States, bladder cancer rates are higher among people who reside in 
northern versus southern states, and is higher for people who live in urban versus rural areas. 
Although this difference suggests that environmental as well as genetic factors may 
contribute to the development and progression of the disease, other studies confirm that 
5 certain genes play a role in bladder cancer.. For example, expression of the tumor suppressor 
gene p53 has been associated with an adverse prognosis for patients with invasive bladder 
cancer. A retrospective study of 243 patients treated by radical cystectomy found that the 
presence of nuclear p53 was an independent predictor for recurrence among patients with mid 
to late stage tumors. Esrig, et al (1994) N.E.J. Med. 331:1259-64. 

10 Urinary bladder cancers represent a spectrum of diseases that can be grouped into 

three general categories: superficial, invasive, and metastatic. The prognosis for treatment is 
highly dependent on the stage at which the tumor is first diagnosed. A unique aspect of 
bladder cancer treatment is that repeated surgical biopsy is an integral part of routine patient 
management. This has permitted the conduct of molecular genetic studies of tumors from 

15 specific stages of the disease. The results of these studies suggest that bladder cancers 

develop and progress along at least two discrete pathways, which may account for differences 
in invasiveness and metastatic potential. Incorporating molecular genetic factors into the 
current paradigm for diagnosis and treatment will optimize the probability of cure and allow 
the quality of life for bladder cancer patients to be maintained. 

20 Early detection and treatment can prevent reoccurrence and progression of the disease 

to an incurable stage. Thus, the identification of novel diagnostic markers and therapeutic 
. targets will improve the current treatment of bladder cancer patients. While industry and 
academia have identified novel sequences, there has not been an equal effort exerted to 
identify the function of these novel sequences in disease states. The elucidation of a role for 

25 novel proteins and compounds in disease states for identification of diagnostic markers and 
therapeutic targets is essential for improving the current treatment of bladder cancer patients. 
Accordingly, provided herein are methods that can be used in diagnosis and prognosis of 
bladder cancer. Additionally, provided herein are molecular targets for therapeutic 
intervention in bladder cancer and other related bladder diseases.. Further provided are 
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methods that can be used to screen candidate bioactive agents for the ability to modulate 
bladder cancer. 

SUMMARY OF THE INVENTION 
5 The present invention therefore provides nucleotide sequences of genes that are up- 

and down-regulated in bladder cancer cells. Such genes are useful for diagnostic purposes, 
and also as targets for screening for therapeutic compounds that modulate bladder cancer, 
such as hormones or antibodies. Other aspects of the invention will become apparent to the 
skilled artisan by the following description of the invention. 
1 0 In one aspect, the present invention provides a method of detecting a bladder cancer- 

associated transcript in a cell from a patient, the method comprising contacting a biological 
sample from the patient with a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1A-13. 

In one embodiment, the present invention provides a method of determining the level 
15 of a bladder cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a bladder 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1 A-13. 
20 In one embodiment, the polynucleotide selectively hybridizes to a sequence at least 

95% identical to a sequence as shown in Tables 1 A-13. 

In one embodiment, the biological sample is a tissue sample. In another embodiment, 
the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent label. 
25 In one embodiment, the polynucleotide is immobilized on a solid surface. 

In one embodiment, the patient is undergoing a therapeutic regimen to treat bladder 
cancer. In another embodiment, the patient is suspected of having metastatic bladder cancer. 
In one embodiment, the patient is a human. 

In one embodiment, the bladder cancer associated transcript is mRNA. 
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In one embodiment, the method further comprises the step of amplifying nucleic acids 
before the step of contacting the biological sample with the polynucleotide. 

hi another aspect, the present invention provides a method of monitoring the efficacy 
of a therapeutic treatment of bladder cancer, the method comprising the steps of: (i) providing 
5 a biological sample from a patient undergoing the therapeutic treatment; and (ii) determining 
the level of a bladder cancer-associated transcript in the biological sample by contacting the 
biological sample with a polynucleotide that selectively hybridizes to a sequence at least 80% 
identical to a sequence as shown in Tables 1 A-13, thereby monitoring the efficacy of the 
therapy. In a further embodiment, the patient has metastatic bladder cancer. In a further 
10 embodiment, the patient has a drug resistant form of bladder cancer. 

In one embodiment, the method further comprises the step of: (iii) comparing the 
level of the bladder cancer-associated transcript to a level of the bladder cancer-associated 
transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. 

1 5 Additionally, provided herein is a method of evaluating the effect of a candidate 

bladder cancer drug comprising administering the drug to a patient and removing a cell 
sample from the patient. The expression profile of the cell is then determined. This method 
may further comprise comparing the expression profile to an expression profile of a healthy 
individual. In a preferred embodiment, said expression profile includes a gene of Tables 1 A- 

20 13. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1 A-13. 

In one embodiment, an expression vector or cell comprises the isolated nucleic acid. 
In one aspect, the present invention provides an isolated polypeptide which is encoded 
25 by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1 A-13 . 

In another aspect, the present invention provides an antibody that specifically binds to 
an isolated polypeptide which is encoded by a nucleic acid molecule having polynucleotide 
sequence as shown in Tables 1A-13. 

In one embodiment, the antibody is conjugated to an effector component, e.g., a 
30 fluorescent label, a radioisotope or a cytotoxic chemical. 

4 
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In one embodiment, the antibody is an antibody fragment. In another embodiment, 
the antibody is humanized. 

In one aspect, the present invention provides a method of detecting a bladder cancer 
cell in a biological sample from a patient, the method comprising contacting the biological 
5 sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting antibodies 
specific to bladder cancer in a patient, the method comprising contacting a biological sample 
from the patient with a polypeptide encoded by a nucleic acid comprising a sequence from 
Tables 1A-13. 

10 In another aspect, the present invention provides a method for identifying a compound 

that modulates a bladder cancer-associated polypeptide, the method comprising the steps of: 
(i) contacting the compound with a bladder cancer-associated polypeptide, the polypeptide 
encoded by a polynucleotide that selectively hybridizes to a sequence at least 80% identical 
to a sequence as shown in Tables 1 A-13; and (ii) determining the functional effect of the 

1 5 compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic effect, or a 
chemical effect. 

In one embodiment, the polypeptide is expressed in a eukaryotic host cell or cell 
membrane. In another embodiment, the polypeptide is recombinant. 
20 In one embodiment, the functional effect is determined by measuring ligand binding 

to the polypeptide. 

La another aspect, the present invention provides a method of inhibiting proliferation 
of a bladder cancer-associated cell to treat bladder cancer in a patient, the method comprising 
the step of administering to the subject a therapeutically effective amount of a compound 
25 identified as described herein. 

In one embodiment, the compound is an antibody. 

In another aspect, the present invention provides a drug screening assay comprising 
the steps of: (i) administering a test compound to a mammal having bladder cancer or to a 
cell sample isolated therefrom; (ii) comparing the level of gene expression of a 
30 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

5 
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as shown in Tables 1 A-l 3 in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
the level of expression of the polynucleotide is a candidate for the treatment of bladder 
cancer. 

5 In one embodiment, the control is a mammal with bladder cancer or a cell sample 

therefrom that has not been treated with the test compound. In another embodiment, the 
control is a normal cell or mammal. 

In one embodiment, the test compound is administered in varying amounts or 
concentrations. In another embodiment, the test compound is administered for varying time 
10 periods. In another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 
hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1A-13 are 
individually compared to their respective levels in a control cell sample or mammal. In a 
15 preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a mammal 
having bladder cancer comprising administering a compound identified by the assay 
described herein. 

In another aspect, the present invention provides a pharmaceutical composition for 
20 treating a mammal having bladder cancer, the composition comprising a compound identified 
by the assay described herein and a physiologically acceptable excipient. 

In one aspect, the present invention provides a method of screening drug candidates 
by providing a cell expressing a gene that is up- and down-regulated as in a bladder cancer. 
In one embodiment, a gene is selected from Tables 1 A-13. The method further includes 
25 adding a drug candidate to the cell and determining the effect of the drug candidate on the 
expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes comparing the 
level of expression in the absence of the drug candidate to the level of expression in the 
presence of the drug candidate, wherein the concentration of the drug candidate can vary 
30 when present, and wherein the comparison can occur after addition or removal of the drug 
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candidate. In a preferred embodiment, the cell expresses at least two expression profile 
genes. The profile genes may show an increase or decrease. 

Also provided is a method of evaluating the effect of a candidate bladder cancer drug 
comprising administering the drug to a transgenic animal expressing or over-expressing the 
5 bladder cancer modulatory protein, or an animal lacking the bladder cancer modulatory 
protein, for example as a result of a gene knockout. 

Moreover, provided herein is a biochip comprising one or more nucleic acid segments 
of Tables 1 A-13, wherein the biochip comprises fewer than 1000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 
10 nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with bladder cancer is 
provided. The method comprises determining the expression of a gene of Tables 1 A- 13 in a 
first tissue type of a first individual, and comparing the distribution to the expression of the 
gene from a second normal tissue type from the first individual or a second unaffected 
1 5 individual. A difference in the expression indicates that the first individual has a disorder 
associated with bladder cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence of a 
gene that is not up- and down-regulated in bladder cancer. 

In one embodiment a method for screening for a bioactive agent capable of interfering 
20 with the binding of a bladder cancer modulating protein (bladder cancer modulatory protein) 
or a fragment thereof and an antibody which binds to said bladder cancer modulatory protein 
or fragment thereof. In a preferred embodiment, the method comprises combining a bladder 
cancer modulatory protein or fragment thereof a candidate bioactive agent and an antibody 
which binds to said bladder cancer modulatory protein or fragment thereof. The method 
25 further includes determining the binding of said bladder cancer modulatory protein or 
fragment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagonist. 
Preferably, the agent inhibits bladder cancer. 

Also provided herein are methods of eliciting an immune response in an individual. 
30 In one embodiment a method provided herein comprises administering to an individual a 
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composition comprising a bladder cancer modulating protein, or a fragment thereof. In 
another embodiment, the protein is encoded by a nucleic acid selected from those of Tables 
1A-13. 

Further provided herein are compositions capable of eliciting an immune response in 
5 an individual In one embodiment, a composition provided herein comprises a bladder cancer 
modulating protein, preferably encoded by a nucleic acid of Tables 1 A- 13 or a fragment 
thereof, and a pharmaceutically acceptable carrier. In another embodiment, said composition 
comprises a nucleic acid comprising a sequence encoding a bladder cancer modulating 
protein, preferably selected from the nucleic acids of Tables 1 A-13, and a pharmaceutically 

1 0 acceptable carrier. 

Also provided are methods of neutralizing the effect of a bladder cancer protein, or a 
fragment thereof, comprising contacting an agent specific for said protein with said protein in 
an amount sufficient to effect neutralization. In another embodiment, the protein is encoded 
by a nucleic acid selected from those of Tables 1A-13. 

1 5 In another aspect of the invention, a method of treating an individual for bladder 

cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a bladder cancer modulating protein. In another embodiment, the 
method comprises administering to a patient having bladder cancer an antibody to a bladder 
cancer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 

20 be a cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides novel 
methods for diagnosis and prognosis evaluation for bladder disease (BD), e.g., cancer, 
25 including metastatic bladder cancer, as well as methods for screening for compositions which 
modulate bladder diseases. Also provided are methods and compositions for treating bladder 
disease. Various related conditions where these markers may be useful also, include, e.g., 
carcinoma in situ, various stages of papillary carcinomas; and such conditions in different 
stages, layers, structural portions, etc. 
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Recent advances in molecular medicine, generally, have increased the interest in 
tumor-specific cell surface antigens that could serve as diagnostic or prognostic markers, or 
as targets for various immunotherapeutic or small molecule strategies. Antigens suitable for 
immunotherapeutic strategies should be highly expressed in cancer tissues and ideally not 
5 expressed in other, e.g., normal, adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated, as a physiological consequence of such expression would be 
limited. Examples of such antigens in cancers other than bladder cancer include Her2/neu 
and the B-cell antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer. 
10 Ross and Fletcher (1998) Stem Cells 16:413-428. Similarly, anti-CD20 monoclonal 
antibodies (Rituxin®/rituximab) are used to effectively treat non-Hodgkin's lymphoma. 
Maloney, et al. (1997) Blood 90:2188-2195; and Leget and Czuczman (1998) Curr. Opin. 
Oncol. 10:548-551. 

15 Definitions 

The term "bladder cancer protein" or "bladder cancer polynucleotide" or "bladder 
cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
alleles, mutants, and interspecies homologues that: (1) have a nucleotide sequence that has 
greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 

20 preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater nucleotide 

sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 
500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 
Tables 1A-13; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an 
immunogen comprising an amino acid sequence encoded by a nucleotide sequence of or 

25 associated with a gene of Tables 1 A-l 3, and conservatively modified variants thereof; (3) 
specifically hybridize under stringent hybridization conditions to a nucleic acid sequence, or 
the complement thereof of Tables 1A-13 and conservatively modified variants thereof; or (4) 
have an amino acid sequence that has greater than about 60% amino acid sequence identity, 
65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 

30 99% or greater amino sequence identity, preferably over a region of over a region of at least 
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about 25, 50, 100, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded 
by a nucleotide sequence of or associated with a gene of Tables 1 A-13. A polynucleotide or 
polypeptide sequence is typically from a mammal including, but not limited to, primate, e.g., 
human; rodent, e.g., rat, mouse, hamster; cow, pig, horse, sheep, or other mammal. A 
5 "bladder cancer polypeptide" and a ,r bladder cancer polynucleotide," include both naturally 
occurring or recombinant forms. 

A "full length" bladder cancer protein or nucleic acid refers to a bladder cancer 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type bladder cancer 

1 0 polynucleotide or polypeptide sequences. The "full length" may be prior to, or after, various 
stages of splicing, including alternative splicing, or post-translation processing. 

"Biological sample" as used herein is a sample of biological tissue or fluid, e.g., that 
contains nucleic acids or polypeptides of a bladder cancer protein, polynucleotide, or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 

15 humans, or rodents, e.g., mice and rats. Biological samples may also include sections of 
tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
blood, plasma, serum, sputum, stool, urine, tears, mucus, hair, skin, etc. Biological samples 
also include explants and primary and/or transformed cell cultures derived from patient 
tissues. A biological sample is typically obtained from a eukaryotic organism, most 

20 preferably a mammal such as a primate, e.g., chimpanzee or human; cow; dog; cat; a rodent, 
e.g., guinea pig, rat, or mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 

25 isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more nucleic acids 
or polypeptide sequences, refer to two or more sequences or subsequences that are the same 

30 or have a specified percentage of amino acid residues or nucleotides that are the same (e.g., 

10 
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about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and aligned 
for maximum correspondence over a comparison window or designated region) as measured 
using a BLAST or BLAST 2.0 sequence comparison algorithms with default parameters 
5 described below, or by manual alignment and visual inspection (see, e.g., NCBI web site 
http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to be 
"substantially identical." This definition also refers to, or may be applied to, the compliment 
of a test sequence. The definition also includes sequences that have deletions and/or 
additions, substitutions, naturally occurring variants, e.g., polymorphic or allelic, and man- 

1 0 made variants. As described below, the preferred algorithms can account for gaps and the 
like. Preferably, identity exists over a region that is at least about 25 amino acids or 
nucleotides in length, or more preferably over a region that is 50-100 amino acids or 
nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, to 

15 which test sequences are compared. When using a sequence comparison computer algorithm, 
test and reference sequences, subsequence coordinates, and sequence algorithm program 
parameters are typically designated. Default or alternative program parameters can be 
selected. The sequence comparison algorithm then calculates the percent sequence identities 
for the test sequences relative to the reference sequence, based on the program parameters. 

20 A "comparison window", as used herein, includes reference to a segment of one of the 

number of contiguous positions selected from the group consisting typically of from about 
20-600, usually about 50-200, more usually about 100-150 in which a sequence may be 
compared to a reference sequence of the same number of contiguous positions after the two 
sequences are optimally aligned. Methods of alignment of sequences for comparison are 

25 well-known in the art. Optimal alignment of sequences for comparison can be conducted, by, 
e.g., the local homology algorithm of Smith and Waterman (1981) Adv. Appl. Math. 2:482, 
the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443- 
453, the search for similarity method of Pearson and Lipman (1988) Proc. Natl Acad. Sci. 
USA 85:2444-448, computerized implementations of these algorithms (GAP, BESTFIT, 

30 FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
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Group, 575 Science Dr., Madison, WI), or manual alignment and visual inspection (see, e.g., 
Ausubel, et al. (eds. 1995 and supplements) Current Protocols in Molecular Biology 
Lippincott. 

Preferred algorithms suitable for determining percent sequence identity and sequence 
5 similarity include the BLAST and BLAST 2.0 algorithms. See Altschul, et al. (1977) Nuc. 
Acids Res. 25:3389-3402; and Altschul, et al. (1990) J. Mol. Biol 215:403-410. BLAST and 
BLAST 2.0 are used, with the parameters described herein, to determine percent sequence 
identity for the nucleic acids and proteins of the invention. Software for performing BLAST 
analyses is publicly available through the National Center for Biotechnology Information 

1 0 (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 

sequence pairs (HSPs) by identifying short words of length W in the query sequence, which 
either match or satisfy some positive-valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul, et al., supra). These initial neighborhood word hits act as seeds for 

15 initiating searches to find longer HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be increased. 
Cumulative scores are calculated using, e.g., for nucleotide sequences, the parameters M 
(reward score for a pair of matching residues; always > 0) and N (penalty score for 
mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to 

20 calculate the cumulative score. Extension of the word hits in each direction are halted when: 
the cumulative alignment score falls off by the quantity X from its maximum achieved value; 
the cumulative score goes to zero or below, due to the accumulation of one or more negative- 
scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm 
parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN 

25 program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1 , an expectation 
(E) of 10, M=5, N=-4, and a comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength of 3, expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Nat'l Acad. Sci. USA 
89:10915-919) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of 

30 both strands. 

12 
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The BLAST algorithm also performs a statistical analysis of the similarity between 
two sequences (see, e.g., Karlin and Altschul (1993) Proc. Nat'l Acad. Sci. USA 90:5873- 
5787). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
5 two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid 
is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably 
less than about 0.01 , and most preferably less than about 0.001 . Log values may be large 
negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 1 10, 150, 170, etc. 

1 0 An indication that two nucleic acid sequences or polypeptides are substantially 

identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 

1 5 indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 

20 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

25 The terms "isolated," "purified," or "biologically pure" refer to material that is 

substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 

30 preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
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some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments typically denotes that a 
nucleic acid or protein gives rise to essentially one band in an electrophoretic gel Preferably, 
it means that the nucleic acid or protein is at least 85% pure, more preferably at least 95% 
5 pure, and most preferably at least 99% pure. "Purify" or "purification" in other embodiments 
means removing at least one contaminant from the composition to be purified. In this sense, 
purification does not require that the purified compound be homogeneous, e.g., 100% pure. 

The terms "polypeptide," "peptide," and "protein" are used interchangeably herein to 
refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which 

10 at least one amino acid residue is an artificial chemical mimetic of a corresponding naturally 
occurring amino acid, as well as to naturally occurring amino acid polymers, those containing 
modified residues, and a non-naturally occurring amino acid polymer. 

The term "amino acid" embraces naturally occurring or synthetic amino acids, amino 
acid analogs, and amino acid mimetics. Naturally occurring amino acids are those encoded 

15 by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, 
y-carboxyglutamate, and O-phosphoserine. Amino acid analogs include compounds that 
share a basic chemical structure with a naturally occurring amino acid, e.g., an a carbon that 
is bound to a hydrogen, a carboxyl group, an amino group, or an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

20 modified R groups (e.g., norleucine) or modified peptide backbones, but share some basic 
chemical structure with a naturally occurring amino acid. Amino acid mimetics include 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that function similarly to a naturally occurring amino acid. 

Amino acids may be referred to herein by their commonly known three letter symbols 

25 or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature 
Commission. Nucleotides, likewise, may be referred to by their commonly accepted single- 
letter codes. 

"Conservatively modified variants" applies to amino acid or nucleic acid sequences. 
With respect to particular nucleic acid sequences, conservatively modified variants refers to 
30 those nucleic acids which encode identical or essentially identical amino acid sequences, or 
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where the nucleic acid does not encode an amino acid sequence, to essentially identical or 
associated, e.g., naturally contiguous, sequences. Because of the degeneracy of the genetic 
code, a large number of functionally identical nucleic acids encode most proteins. For 
instance, the codons GCA, GCC, GCG, and GCU all encode the amino acid alanine. Thus, at 
5 each position where an alanine is specified by a codon, the codon can be altered to another of 
the corresponding codons described without altering the encoded polypeptide. Such nucleic 
acid variations are "silent variations," which are one species of conservatively modified 
variations. Each nucleic acid sequence herein which encodes a polypeptide also describes 
silent variations of the nucleic acid. One of skill will recognize that in certain contexts each 

10 codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and 
TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a 
functionally identical molecule. Accordingly, often silent variations of a nucleic acid which 
encodes a polypeptide is implicit in a described sequence with respect to the expression 
product, but not with respect to actual probe sequences. 

15 As to amino acid sequences, one of skill will recognize that individual substitutions, 

deletions, or additions to a nucleic acid, peptide, polypeptide, or protein sequence which 
alters, adds, or deletes a single amino acid or a small percentage of amino acids in the 
encoded sequence is a "conservatively modified variant" where the alteration results in the 
substitution of an amino acid with a chemically similar amino acid. Conservative substitution 

20 tables providing functionally similar amino acids are well known in the art. Such 

conservatively modified variants are in addition to and do not exclude polymorphic variants, 
interspecies homologs, and alleles of the invention. Typically conservative substitutions for 
one another include: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) 
Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), 

25 Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine 
(S), Threonine (T); and 8) Cysteine (C), Methionine (M). See, e.g., Creighton (1984) 
Proteins: Structure and Molecular Properties Freeman. 

Macromolecular structures such as polypeptide structures can be described in terms of 
various levels of organization. See, e.g., Alberts, et al. (eds. 2001) Molecular Biology of the 

30 Cell (4th ed.) Garland; and Cantor and Schimmel (1 980) Biophysical Chemistry Part I: The 
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Conformation of Biological Macromolecules Freeman. "Primary structure" refers to the 
amino acid sequence of a particular peptide. "Secondary structure" refers to locally ordered, 
three dimensional structures within a polypeptide. These structures are commonly known as 
domains, which are portions of a polypeptide that often form a compact unit of the 
5 polypeptide, and are typically about 25-500 amino acids long. Typical domains are made of 
sections of lesser organization such as stretches of p-sheet and a-helices. "Tertiary structure" 
refers to the complete three dimensional structure of a polypeptide monomer. "Quaternary 
structure" refers to the three dimensional structure formed, usually by the nbncovalent 
association of independent tertiary units. Anisotropic terms are also known as energy terms. 

10 "Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical equivalents 

used herein means at least two nucleotides covalently linked together. Oligonucleotides are 
typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more nucleotides in length, up 
to about 100 nucleotides in length. Nucleic acids and polynucleotides are polymers, 
including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, etc. A 

15 nucleic acid of the present invention will generally contain phosphodiester bonds. In some 
cases, nucleic acid analogs are included that may have alternate backbones, e.g., 
phosphoramidate (Beaucage, et al. (1993) Tetrahedron 49:1925-963 and references therein; 
Letsinger (1970) J. Org. Chem. 35:3800-803; Sprinzl, et al. (1977) Eur. J. Biochem. 81:579- 
589; Letsinger, et al. (1986) Nucl. Acids Res. 14:3487-499; Sawai, et al. (1984) Chem. Lett. 

20 805; Letsinger, et al. (1988) J. Am. Chem. Soc. 1 10:4470-471; and Pauwels, et al. (1986) 
Chemica Scripta 26:141-149); phosphorothioate (Mag, et al. (1991), Nucleic Acids Res. 
19:1437-441; and U.S. Patent No. 5,644,048); phosphorodithioate (Brill, et al. (1989) J. Am. 
Chem. Soc. 111:2321-322); O-methylphophoroamidite linkages (see Eckstein (1992) 
Oligonucleotides and Analogues: A Practical Approach Oxford Univ. Press); and peptide 

25 nucleic acid backbones and linkages (see Egholm (1992) J. Am. Chem. Soc. 1 14: 1 895-897; 
Meier, et al. (1992) Chem. Int. Ed. Bnel. 31:1008-010; Nielsen (1993) Nature 365:566-568; 
Carlsson, et al. (1996) Nature 380:207. Other analog nucleic acids include those with 
positively charged backbones(Denpcy, et al. (1995) Proc. Natl Acad. Sci. USA 92:6097- 
101); non-ionic backbones (U.S. Patent Nos. 5,386,023; 5,637,684; 5,602,240; 5,216,141; 

30 and 4,469,863; Kiedrowshi, et al. (1991) Angew. Chem. Intl. Ed. English 30:423-426; 
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Letsinger, et al. (1988) J. Am. Chem. Soc. 110:4470-471; Jung, et al. (1994) Nucleoside and 
Nucleotide 13:1597-xxx; Chapters 2-3 of Sanghvi and Cook (eds. 1994) Carbohydrate 
Modifications in Antisense Research ACS Symposium Series 580; Mesmaeker, et al. (1994) 
Bioorganic and Medicinal Chem. Lett. 4:395-398; Jeffs, et al. (1994) J. Biomolecular NMR 
5 34:17; Horn, et al. (1996) Tetrahedron Lett. 37:743-xxx); and non-ribose backbones (see U.S. 
Patent Nos. 5,235,033 and 5,034,506, and Chapters 6-7 of Sanghvi and Cook (eds. 1994) 
Carbohydrate Modifications in Antisense Research ACS Symposium Series 580. Nucleic 
acids containing one or more carbocyclic sugars are also contemplated. See Jenkins and 
Turner (1995) Chem. Soc. Rev. 24:169-176. Several nucleic acid analogs are described in 

10 Rawls (page 35, June 2, 1997) C&B News . Modifications of the ribose-phosphate backbone 
may be made, e.g., to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. Mixtures of naturally occurring nucleic acids and 
analogs can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures 
of naturally occurring nucleic acids and analogs may be made. 

1 5 Particularly preferred are peptide nucleic acids (PNA) which include peptide nucleic 

acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
The PNA backbone typically exhibits improved hybridization kinetics, exhibiting larger 
changes in the melting temperature (Tm) for mismatched versus perfectly matched basepairs. 

20 DNA and RNA typically exhibit a 2-4° C drop in T m for an internal mismatch. With the 
non-ionic PNA backbone, the drop is closer to 7-9° C. And due to their non-ionic nature, 
hybridization of the polymers is relatively insensitive to salt concentration. In addition, 
PNAs are not as easily degraded by cellular enzymes, and can be more stable. 

The nucleic acids may be single stranded or double stranded, or contain portions of 

25 both double stranded or single stranded sequence. As will be appreciated by those in the art, 

the depiction of a single strand also defines the sequence of the complementary strand; thus 

the sequences described herein also provide the complement of the sequence. The nucleic 

acid may be DNA, both genomic and cDNA, RNA, or a hybrid, where the nucleic acid may 

contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases, 

30 including uracil, adenine, thymine, cytosine, guanine, inosine, xanthine, hypoxanthine, 
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isocytosine, isoguanine, etc. "Transcript" typically refers to a naturally occurring RNA, e.g., 
a pre-mRNA, hnRNA, or mRNA. As used herein, the term "nucleoside" includes nucleotides 
and nucleoside and nucleotide analogs, and modified nucleosides such as amino modified 
nucleosides. In addition, "nucleoside" includes non-naturally occurring analog structures. 
5 Thus, e.g., the individual units of a peptide nucleic acid, each containing a base, are referred 
to herein as a nucleoside. 

A "label" or "detectable moiety" is a composition detectable by spectroscopic, 
photochemical, biochemical, immunochemical, chemical, or other physical means. Direct or 
indirect methods are comtemplated. For example, useful labels include 32p s fluorescent 

10 dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, 

digoxigenin, or haptens and proteins or other entities which can be made detectable, e.g., by 
incorporating a radiolabel into the peptide or used to detect antibodies specifically reactive 
with the peptide. The labels may be incorporated into the bladder cancer nucleic acids, 
proteins, and antibodies. Methods are well known for conjugating the antibody to the label, 

1 5 including those methods described by Hunter, et al. (1962) Nature 144:945-946; David, et al. 
(1974) Biochemistry 13:1014-021; Pain, et al. (1981) J. Immunol. Meth. 40:219-230; and 
Nygren (1982) J. Histochem. and Cvtochem. 30:407-412. 

An "effector" or "effector moiety" or "effector component" is a molecule that is bound 
(or linked, or conjugated), either covalently, through a linker or a chemical bond, or 

20 noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to a target, 
e.g., an antibody. The "effector" can be a variety of molecules including, e.g., detection 
moieties including radioactive compounds; fluorescent compounds; an enzyme or substrate; 
tags such as epitope tags; a toxin; activatable moieties; a chemotherapeutic agent; a lipase; an 
antibiotic; a radioisotope emitting "hard", e.g., beta radiation; or an attracting moiety. 

25 A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 

covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 

30 partners binds to the other, e.g., biotin, streptavidin. 
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As used herein a "nucleic acid probe or oligonucleotide" is defined as a nucleic acid 
capable of binding to a target nucleic acid of complementary sequence through one or more 
types of chemical bonds, usually through complementary base pairing, usually through 
hydrogen bond formation. As used herein, a probe may include natural (e.g., A, G, C, or T) 
5 or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be 
joined by a linkage other than a phosphodiester bond, so long as it does not functionally 
interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in which the 
constituent bases are joined by peptide bonds rather than phosphodiester linkages. Probes 
may bind target sequences lacking complete complementarity with the probe sequence 

10 depending upon the stringency of the hybridization conditions. The probes are preferably 
directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly 
labeled such as with biotin to which streptavidin linked label may bind. By assaying for the 
presence or absence of the probe, one can detect the presence or absence of the select 
sequence or subsequence. Diagnosis or prognosis may be based at the genomic level, or at 

15 the level of RNA or protein expression. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 
the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., recombinant 

20 cells express genes that are not found within the native (non-recombinant) form of the cell or 
express native genes that are otherwise abnormally expressed, under expressed, or not 
expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using 
polymerases and endonucleases, in a form not normally found in nature. In this maimer, 

25 operable linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, e.g., using in vivo cellular machinery of the 

30 host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
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recombinantly, although subsequently replicated non-recombinantly, are still considered 
recombinant for the purposes of the invention. Similarly, a "recombinant protein" is a protein 
made using recombinant techniques, e.g., through the expression of a recombinant nucleic 
acid as depicted above. 

5 The term "heterologous" when used with reference to portions of a nucleic acid 

indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
1 0 coding region from another source. Similarly, a heterologous protein will often refer to two 
or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that direct 
transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic acid 

15 sequences near the start site of transcription, such as, in the case of a polymerase II type 

promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 

20 active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

25 An "expression vector" is a nucleic acid construct, generated recombinantly or 

synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

20 
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The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
5 The phrase "stringent hybridization conditions" refers to conditions under which a 

probe will hybridize to its target subsequence, typically in a complex mixture of nucleic 
acids, but to no other sequences. Stringent conditions are sequence-dependent and will be 
different in different circumstances. Longer sequences hybridize specifically at higher 
temperatures. An extensive guide to the hybridization of nucleic acids is found in "Overview 
10 of principles of hybridization and the strategy of nucleic acid assays" in Tijssen (1993) 

Hybridization with Nucleic Probes (Techniques in Biochemistry and Molecular Biology; vol. 
24) Elsevier. Generally, stringent conditions are selected to be about 5-10° C lower than the 
thermal melting point (Tjn) for the specific sequence at a defined ionic strength pH. The T m 

is the temperature (under defined ionic strength, pH, and nucleic acid concentration) at which 
1 5 50% of the probes complementary to the target hybridize to the target sequence at 

equilibrium (as the target sequences are present in excess, at T m , 50% of the probes are 
occupied at equilibrium). Stringent conditions will be those in which the salt concentration is 
less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or 
other salts) at pH 7.0 to 8.3 and the temperature is at least about 30° C for short probes (e.g., 
20 about 10-50 nucleotides) and at least about 60° C for long probes (e.g., greater than about 50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. For selective or specific hybridization, a positive signal is at least 
about two times background, preferably about 10 times background hybridization. 
Exemplary stringent hybridization conditions can be as following: 50% formamide, 5x SSC, 
25 and 1% SDS, incubating at 42° C, or, 5x SSC, 1% SDS, incubating at 65° C, with wash in 
0.2x SSC, and 0.1% SDS at 65° C. For PCR, a temperature of about 36° C is typical for low 
stringency amplification, although annealing temperatures may vary between about 32-48° C 
depending on primer length. For high stringency PCR amplification, a temperature of about 
62° C is typical, although high stringency annealing temperatures can range from about 50- 
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65° C, depending on the primer length and specificity. Typical cycle conditions for both high 
and low stringency amplifications include a denaturation phase of 90-95° C for 30-120 sec, 
an annealing phase lasting 30-120 sec, and an extension phase of about 72° C for 1-2 min. 
Protocols and guidelines for low and high stringency amplification reactions are provided, 
5 e.g., in Innis, et al. (1990) PCR Protocols, A Guide to Methods and Applications Academic 
Press NY. 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides which they encode are substantially identical. This 
occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy 

10 permitted by the genetic code. In such cases, the nucleic acids typically hybridize under 
moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37° C, and a wash in IX SSC at 45° C. A positive hybridization is at least about 
twice background. Alternative hybridization and wash conditions can be utilized to provide 

1 5 conditions of similar stringency. Additional guidelines for determining hybridization 

parameters are provided in numerous references, e.g.,. Ausubel, et al. Current Protocols in 
Molecular Biology Lippincott. 

The phrase "functional effects" in the context of assays for testing compounds that 
modulate activity of a bladder cancer protein includes the determination of a parameter that is 

20 indirectly or directly under the influence of the bladder cancer protein or nucleic acid, e.g., a 
functional, physical, or chemical effect, such as the ability to decrease bladder cancer. It 
includes ligand binding activity; cell growth on soft agar; anchorage dependence; contact 
inhibition and density limitation of growth; cell viability, cellular proliferation; cellular 
transformation; growth factor or serum dependence; tumor specific marker levels; 

25 invasiveness into Matrigel; tumor growth and metastasis in vivo; mRNA and protein 

expression in cells undergoing metastasis, and other characteristics of bladder cancer cells. 
"Functional effects" include in vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a 

30 bladder cancer protein sequence, e.g., functional, enzymatic, physical and chemical effects. 
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Such functional effects can be measured by many means known to those skilled in the art, 
e.g., changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive 
index), hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the bladder cancer protein; 
5 measuring binding activity or binding assays, e.g., binding to antibodies or other ligands, and 
measuring cellular proliferation or metabolism. Determination of the functional effect of a 
compound on bladder cancer can also be performed using bladder cancer assays, such as, in 
vitro assays, e.g., cell growth on soft agar; anchorage dependence; contact inhibition and 
density limitation of growth; cellular proliferation; cellular transformation; growth factor or 

10 serum dependence; tumor specific marker levels; invasiveness into Matrigel; tumor growth 
and metastasis in vivo; mRNA and protein expression in cells undergoing metastasis, and 
other characteristics of bladder cancer cells. Functional effects can be evaluated by many 
means, e.g., microscopy for quantitative or qualitative measures of alterations in 
morphological features, measurement of changes in RNA or protein levels for bladder 

1 5 cancer-associated sequences, measurement of RNA stability, identification of downstream or 
reporter gene expression (CAT, luciferase, P-gal, GFP, and the like), e.g., via 
chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible 
markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of bladder cancer polynucleotide and 

20 polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules or 
compounds identified using in vitro and in vivo assays of bladder cancer polynucleotide and 
polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of bladder cancer proteins, e.g., antagonists. Antisense nucleic acids 

25 may seem to inhibit expression and subsequent function of the protein. "Activators" are 

compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 
up regulate bladder cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of bladder cancer proteins, e.g., versions with altered activity, 
as well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 

30 chemical molecules and the like. Such assays for inhibitors and activators include, e.g., 
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expressing the bladder cancer protein in vitro, in cells, or cell membranes, applying putative 
modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of bladder cancer can also be identified by incubating 
bladder cancer cells with the test compound and determining increases or decreases in the 
5 expression of 1 or more bladder cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or 
more bladder cancer proteins, such as bladder cancer proteins encoded by the sequences set 
out in Tables 1A-13. 

Samples or assays comprising bladder cancer proteins that are treated with a potential 
activator, inhibitor, or modulator are compared to control samples without the inhibitor, 

10 activator, or modulator to examine the extent of inhibition. Control samples (untreated with 
inhibitors) are assigned a relative protein activity value of 100%. Inhibition of a polypeptide 
is achieved when the activity value relative to the control is about 80%, preferably about 
50%, more preferably about 25-0%. Activation of a bladder cancer polypeptide is achieved 
when the activity value relative to the control (untreated with activators) is about 110%, more 

15 preferably about 150%, more preferably about 200-500% (e.g., two to five fold higher 
relative to the control), more preferably about 1000-3000% higher. 

The phrase "changes in cell growth" refers to a change in cell growth and proliferation 
characteristics in vitro or in vivo, such as cell viability, formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 

20 limitation of growth, loss of growth factor or serum requirements, changes in cell 

morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
ability to form or suppress tumors when injected into suitable animal hosts, and/or 
immortalization of the cell. See, e.g., pp. 231-241 of Freshney (1994) Culture of Animal 
Cells:A Manual of Basic Technique (3d ed.). 

25 "Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 

"Cancer cells," "transformed" cells or "transformation" in tissue culture, refers to 
spontaneous or induced phenotypic changes that do not necessarily involve the uptake of new 
genetic material. Although transformation can arise from infection with a transforming virus 
and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 

30 spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
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Transformation is associated with phenotypic changes, such as immortalization of cells, 
aberrant growth control, nonmorphological changes, and/or malignancy. See, Freshney 
(2000) Culture of Animal Cells: A Manual of Basic Technique (4th ed.) Wiley-Liss. 
"Antibody" refers to a polypeptide comprising a framework region from an 
5 immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 

10 IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul 
(ed. 1999) Fundamental Immunology (4th ed.) Raven. 

An exemplary immunoglobulin (antibody) structural unit comprises a tetramer. Each 
tetramer is composed of two identical pairs of polypeptide chains, each pair having one 

1 5 "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 

chain defines a variable region of about 100 to 1 10 or more amino acids primarily responsible 
for antigen recognition. The terms variable light chain (VjJ and variable heavy chain (Vjj) 

refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well-characterized 

20 fragments produced by digestion with various peptidases. Thus, e.g., pepsin digests an 

antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab 
which itself is a light chain joined to Vj[-Ch1 by a disulfide bond. The F(ab) ? 2 may be 
reduced under mild conditions to break the disulfide linkage in the hinge region, thereby 
converting the F(ab)*2 dimer into an Fab' monomer. The Fab 1 monomer is essentially Fab 

25 with part of the hinge region. See Paul (ed. 1999) Fundamental Immunology (4th ed.) Raven. 
While various antibody fragments are defined in terms of the digestion of an intact antibody, 
such fragments may be synthesized de novo either chemically or by using recombinant DNA 
methodology. Thus, the term antibody, as used herein, also includes antibody fragments 
either produced by the modification of whole antibodies, or those synthesized de novo using 
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recombinant DNA methodologies (e.g., single chain Fv) or those identified using phage 
display libraries. See, e.g., McCafferty, et al. (1990) Nature 348:552-554. 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many techniques can be used. See, e.g., Kohler and Milstein (1975) Nature 
5 256:495-497; Kozbor, et al. (1983) Immunology Today 4:72; Cole, et al. pp. 77-96 in 
- Reisfeld and Sell (1985) Monoclonal Antibodies and Cancer Therapy Liss; Coligan (1991) 
Current Protocols in Immunology Lippincott; Harlow and Lane (1988) Antibodies: A 
Laboratory Manual CSH Press; and Goding (1986) Monoclonal Antibodies: Principles and 
Practice (2d ed.) Academic Press. Techniques for the production of single chain antibodies 

10 (U.S. Patent 4,946,778) can be adapted to produce antibodies to polypeptides of this 

invention. Also, transgenic mice, or other organisms such as other mammals, may be used to 
express humanized antibodies. Alternatively, phage display technology can be used to 
identify antibodies and heteromeric Fab fragments that specifically bind to selected antigens. 
See, e.g., McCafferty, et al. (1990) Nature 348:552-554; and Marks, et al. (1992) 

15 Biotechnology 10:779-783. 

A "chimeric antibody" is an antibody molecule in which (a) the constant region, or a 
portion thereof, is altered, replaced, or exchanged so that the antigen binding site (variable 
region) is linked to a constant region of a different or altered class, effector function, and/or 
species, or an entirely different molecule which confers new properties to the chimeric 

20 antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 

region, or a portion thereof, is altered, replaced, or exchanged with a variable region having a 
different or altered antigen specificity. 

Identification of bladder cancer-associated sequences 

25 In one aspect, the expression levels of genes are determined in different patient 

samples for which diagnosis information is desired, to provide expression profiles. An 
expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have a particular gene similarly expressed, the evaluation of a 
number of genes simultaneously allows the generation of a gene expression profile that is 

30 characteristic of the state of the cell. That is, normal tissue (e.g., normal bladder or other 
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tissue) may be distinguished from cancerous or metastatic cancerous tissue of the bladder, or 
bladder cancer tissue or metastatic bladder cancerous tissue can be compared with tissue 
samples of bladder and other tissues from surviving cancer patients. By comparing 
expression profiles of tissue in known different bladder cancer states, information regarding 
5 which genes are important (including both up- and down-regulation of genes) in each of these 
states is obtained. 

The identification of sequences that are differentially expressed in bladder cancer 
versus non-bladder cancer tissue allows the use of this information in a number of ways. For 
example, a particular treatment regime may be evaluated: does a chemotherapeutic drug act 

10 to down-regulate bladder cancer, and thus tumor growth or recurrence, in a particular patient; 
or does chemotherapy or radiation therapy induce expression of particular targets. Similarly, 
diagnosis and treatment outcomes may be done or confirmed by comparing patient samples 
with the known expression profiles. Metastatic tissue can also be analyzed to determine the 
stage of bladder cancer in the tissue or origin of a primary tumor. Furthermore, these gene 

1 5 expression profiles (or individual genes) allow screening of drug candidates with an eye to 
mimicking or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the bladder cancer expression profile. This may be done by making biochips 
comprising sets of important bladder cancer genes, which can then be used in these screens. 
These methods can also be applied on the protein basis; that is, protein expression levels of 

20 the bladder cancer proteins can be evaluated for diagnostic purposes or to screen candidate 
agents. In addition, the bladder cancer nucleic acid sequences can be administered for gene 
therapy purposes, including the administration of antisense or inhibitory nucleic acids, or the 
bladder cancer proteins (including antibodies and other modulators thereof) administered as 
therapeutic drugs. 

25 Thus the present invention provides nucleic acid and protein sequences that are 

differentially expressed in bladder disease or cancer relative to normal tissues and/or non- 
malignant bladder tissue, herein termed "bladder cancer sequences." As outlined below, 
bladder cancer sequences include those that are up-regulated (e.g., expressed at a higher 
level) in bladder cancer, as well as those that are down-regulated (e.g., expressed at a lower 

30 level). In a preferred embodiment, the bladder cancer sequences are from humans; however, 
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as will be appreciated by those in the art, bladder cancer sequences from other organisms may 
be useful in animal models of disease and drug evaluation; thus, other bladder cancer 
sequences are provided, from vertebrates, including mammals, including rodents (rats, mice, 
hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 
5 horses, etc.) and pets, e.g., (dogs, cats, etc.). Bladder cancer sequences from other organisms 
may be obtained using the techniques outlined below. 

Bladder cancer sequences can include both nucleic acid and amino acid sequences. 
Bladder cancer nucleic acid sequences are useful in a variety of applications, including 
diagnostic applications, which will detect naturally occurring nucleic acids, as well as 

10 screening applications. Biochips comprising nucleic acid probes or PCR microtiter plates 
with selected probes to the bladder cancer sequences can be generated. 

A bladder cancer sequence can be initially identified by substantial nucleic acid 
and/or amino acid sequence homology to the bladder cancer sequences outlined herein. Such 
homology can be based upon the overall nucleic acid or amino acid sequence, and is 

15 generally determined as outlined below, using either homology programs or hybridization 
conditions. 

For identifying bladder cancer-associated sequences, the bladder cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal, non- 
malignant, or cancerous tissues, or tumor tissue samples from patients who have metastatic 

20 disease vs. non metastatic tissue. Other suitable tissue comparisons include comparing 
bladder cancer samples with metastatic cancer samples from other cancers, such as lung, 
bladder, gastrointestinal cancers, ovarian, etc. Samples of different stages of bladder cancer, 
e.g., survivor tissue, drug resistant states, and tissue undergoing metastasis, are applied to 
biochips comprising nucleic acid probes. The samples are first microdissected, if applicable, 

25 and treated for the preparation of mRNA. Suitable biochips are commercially available, e.g., 
from Affymetrix. Gene expression profiles as described herein are generated and the data 
analyzed. 

In one embodiment, genes showing changes in expression as between normal and 
disease states are compared to genes expressed in other normal tissues, preferably normal 
30 bladder, but also including, and not limited to lung, heart, brain, liver, bladder, kidney, 
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muscle, colon, small intestine, large intestine, spleen, bone, and placenta. In a preferred 
embodiment, those genes identified during the bladder cancer screen that are expressed in a 
significant amount in other tissues are removed from the profile, although in some 
embodiments, this is not necessary. That is, when screening for drugs, it is usually preferable 
5 that the target be disease specific, e.g., not be expressed on critical organs. 

In a preferred embodiment, bladder cancer sequences are those that are up-regulated 
in bladder cancer; that is, the expression of these genes is higher in the bladder cancer tissue 
as compared to non-cancerous tissue. "Up-regulation" as used herein often means at least 
about a two-fold change, preferably at least about a three fold change, with at least about 

1 0 five-fold or higher being preferred. Unigene cluster identification numbers and accession 
numbers herein are for the GenBank sequence database and sequences of accession numbers 
are hereby expressly incorporated by reference. GenBank is known in the art, see, e.g., 
Benson, et al. (1998) Nuc. Acids Res. 26:1-7 and http://www.ncbi.nlm.nih.gov/. Sequences 
are also available in other databases, e.g., European Molecular Biology Laboratory (EMBL) 

15 and DNA Database of Japan (DDBJ). 

In another preferred embodiment, bladder cancer sequences are those that are down- 
regulated in the bladder cancer; that is, the expression of these genes is lower in bladder 
cancer tissue as compared to non-cancerous tissue (see, e.g., Tables 1 A-13). "Down- 
regulation" as used herein often means at least about a two-fold change, preferably at least 

20 about a three fold change, with at least about five- fold or higher being preferred. 

Informatics 

The ability to identify genes that are over or under expressed in bladder cancer can 
additionally provide high-resolution, high-sensitivity datasets which can be used in the areas 
25 of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 

biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with bladder cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation. See Anderson (June 1 1-12, 1998) Pharmaceutical Proteomics: 
30 Targets, Mechanism, and Function, paper presented at the IBC Proteomics conference, 
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Coronado, CA. Subcellular toxicological information can also be utilized in a biological 
sensor device to predict the likely toxicological effect of chemical exposures and likely 
tolerable exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue 
from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 
5 saccharides, lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that includes 
at least one set of assay data. The data contained in the database is acquired, e.g., using array 
analysis either singly or in a library format. The database can be in a form in which data can 
be maintained and transmitted, but is preferably an electronic database. The electronic 

1 0 database of the invention can be maintained on an electronic device allowing for the storage 
of and access to the database, such as a personal computer, but is preferably distributed on a 
wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence data is for 
clarity of illustration only. It will be apparent that similar databases can be assembled for 

1 5 assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative and/or 
absolute abundance of a variety of molecular and macromolecular species from a biological 
sample undergoing bladder cancer, e.g., the identification of bladder cancer-associated 
sequences described herein, provide an abundance of information, which can be correlated 

20 with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
gene-disease causal linkages, identification of correlates of immunity and physiological 
status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

25 An array of methods for indexing and retrieving biomolecular information is known 

in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational database 
system for storing biomolecular sequence information in a manner that allows sequences to 
be catalogued and searched according to one or more protein function hierarchies. U.S. 
Patent 5,953,727 discloses a relational database having sequence records containing 

30 information in a format that allows a collection of partial-length DNA sequences to be 
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catalogued and searched according to association with one or more sequencing projects for 
obtaining full-length sequences from the collection of partial length sequences. U.S. Patent 
5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
5 between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,81 8 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 

10 described as on-line analytical processing (OLAP), which entails the consolidation of 

projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

15 the merger of two or more such tree structures. 

See also Mount, et al. (2001) Bioinformatics CSH Press, NY; Durbin, et al. (eds. 
1999) Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids 
Cambridge Univ. Press; Baxevanis and Oeullette (eds. 1998) Bioinformatics: A Practical 
Guide to the Analysis of Genes and Proteins (2d. ed.) Wiley-Liss; Rashidi and Buehler 

20 (1999) Bioinformatics: Basic Applications in Biological Science and Medicine CRC Press; 
Setubal, et al. (eds 1997) Introduction to Computational Molecular Biology Brooks/Cole; 
Misener and Krawetz (eds. 2000) Bioinformatics: Methods and Protocols Oxford Univ. 
Press; Higgins and Taylor (eds. 2000) Bioinformatics: Sequence, Structure, and Databanks: A 
Practical Approach Oxford Univ. Press; Brown (2001) Bioinformatics: A Biologists Guide to 

25 Biocomputing and the Internet Eaton Pub.; Han and Kamber (2000) Data Mining: Concepts 
and Techniques Kaufmann Pub.; and Waterman (1995) Introduction to Computational 
Biology: Maps, Sequences, and Genomes Chap and Hall. 

The present invention provides a computer database comprising a computer and 
software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
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with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing sample 
is from a control tissue sample known to be free of pathological disorders. In a variation, at 
5 least one of the sources is a known pathological tissue specimen, e.g., a neoplastic lesion or 
another tissue specimen to be analyzed for bladder cancer. In another variation, the assay 
records cross-tabulate one or more of the following parameters for each target species in a 
sample: (1) a unique identification code, which can include, e.g., a target molecular structure 
and/or characteristic separation coordinate (e.g., electrophoretic coordinates); (2) sample 

1 0 source; and (3) absolute and/or relative quantity of the target species present in the sample. 

The invention also provides for the storage and retrieval of a collection of target data 
in a computer data storage apparatus, which can include magnetic disks, optical disks, 
magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, magnetic 
bubble memory devices, and other data storage devices, including CPU registers and on-CPU 

1 5 data storage arrays. Typically, the target data records are stored as a bit pattern in an array of 
magnetic domains on a magnetizable medium or as an array of charge states or transistor gate 
states, such as an array of cells in a DRAM device (e.g., each cell comprised of a transistor 
and a charge storage area, which may be on the transistor). In one embodiment, the invention 
provides such storage devices, and computer systems built therewith, comprising a bit pattern 

20 encoding a protein expression fingerprint record comprising unique identifiers for at least 10 
target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides a 
method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 

25 or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FAST A, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 
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The invention also preferably provides a magnetic disk, such as an IBM-compatible 
(DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format (e.g., Linux, 
SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or hard (fixed, 
Winchester) disk drive, comprising a bit pattern encoding data from an assay of the invention 
5 in a file format suitable for retrieval and processing in a computerized sequence analysis, 
comparison, or relative quantitation method. 

The invention also provides a network, comprising a plurality of computing devices 
linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, ISDN 
line, wireless network, optical fiber, or other suitable signal transmission medium, whereby at 

10 least one network device (e.g., computer, disk array, etc.) comprises a pattern of magnetic 
domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM cells) 
composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 

15 ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for comparing a 
query target to a database containing an array of data structures, such as an assay result 

20 obtained by the method of the invention, and ranking database targets based on the degree of 
identity and gap weight to the target data. A central processor is preferably initialized to load 
and execute the computer program for alignment and/or comparison of the assay results. 
Data for a query target is entered into the central processor via an I/O device. Execution of 
the computer program results in the central processor retrieving the assay data from the data 

25 file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to secondary 
memory, which is typically random access memory (e.g., DRAM, SRAM, SGRAM, or 
SDRAM). Targets are ranked according to the degree of correspondence between a selected 
assay characteristic (e.g., binding to a selected affinity moiety) and the same characteristic of 

30 the query target and results are output via an I/O device. For example, a central processor 
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can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, PA-8000, SPARC, 
MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or public domain 
molecular biology software package (e.g., UWGCG Sequence Analysis Software, Darwin); a 
data file can be an optical or magnetic disk, a data server, a memory device (e.g., DRAM, 
5 SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, etc.); an I/O device can 
be a terminal comprising a video display and a keyboard, a modem, an ISDN terminal 
adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or other suitable I/O 
device. 

The invention also preferably provides the use of a computer system, such as that 
10 described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 

collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

15 

Characteristics of bladder cancer-associated proteins 

Bladder cancer proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment, the bladder 
cancer protein is an intracellular protein. Intracellular proteins may be found in the 

20 cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g., signaling pathways); aberrant expression of such 
proteins often results in unregulated or disregulated cellular processes (see, e.g., Alberts, et 
al. (1994) Molecular Biology of the Cell (3d ed.) Garland. For example, many intracellular 
proteins have enzymatic activity such as protein kinase activity, protein phosphatase activity, 

25 protease activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular 
proteins also serve as docking proteins that are involved in organizing complexes of proteins, 
or targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence in the 

30 proteins of one or more structural motifs for which defined functions have been attributed. In 
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addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
5 domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of amino acid 

10 sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 
One useful database is Pfam (protein families), which is a large collection of multiple 
sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in St. Louis, the Sanger 

15 Center in England, and the Karolinska Institute in Sweden. See, e.g., Bateman, et al. (2000) 
Nuc. Acids Res . 28:263-266; Sonnhammer, et al. (1997) Proteins 28:405-420; Bateman, et al. 
(1999) Nuc. Acids Res. 27:260-262; and Sonnhammer, et al. (1998) Nuc. Acids Res. 26:320- 
322. 

In another embodiment, the bladder cancer sequences are transmembrane proteins. 

20 Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. They may 
have an intracellular domain, an extracellular domain, or both. The intracellular domains of 
such proteins may have a number of functions including those already described for 
intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 

25 domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane domains. 
30 For example, receptor tyrosine kinases, certain cytokine receptors, receptor guanylyl cyclases 
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and receptor serine/threonine protein kinases contain a single transmembrane domain. 
However, various other proteins including channels and adenylyl cyclases contain numerous 
transmembrane domains. Many important cell surface receptors such as G protein coupled 
receptors (GPCRs) are classified as "seven transmembrane domain" proteins, as they contain 
5 7 membrane spanning regions. Characteristics of transmembrane domains include 

approximately 17 consecutive hydrophobic amino acids that may be followed by charged 
amino acids. Therefore, upon analysis of the amino acid sequence of a particular protein, the 
localization and number of transmembrane domains within the protein may be predicted (see, 
e.g., PSORT web site http://psort.nibb.ac.jp/). Important transmembrane protein receptors 

10 include, but are not limited to the insulin receptor, insulin-like growth factor receptor, human 
growth hormone receptor, glucose transporters, transferrin receptor, epidermal growth factor 
receptor, low density lipoprotein receptor, epidermal growth factor receptor, leptin receptor, 
and interleukin receptors, e.g., IL-1 receptor, EL-2 receptor, etc. 

The extracellular domains of transmembrane proteins are diverse; however, conserved 

1 5 motifs are found repeatedly among various extracellular domains. Conserved structure 
and/or functions have been ascribed to different extracellular motifs. Many extracellular 
domains are involved in binding to other molecules. In one aspect, extracellular domains are 
found on receptors. Factors that bind the receptor domain include circulating ligands, which 
may be peptides, proteins, or small molecules such as adenosine and the like. For example, 

20 growth factors such as EGF, FGF, and PDGF are circulating growth factors that bind to their 
cognate receptors to initiate a variety of cellular responses. Other factors include cytokines, 
mitogenic factors, neurotrophic factors and the like. Extracellular domains also bind to cell- 
associated molecules. In this respect, they mediate cell-cell interactions. Cell-associated 
ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) anchor, or 

25 may themselves be transmembrane proteins. Extracellular domains also associate with the 
extracellular matrix and contribute to the maintenance of the cell structure. 

Bladder cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 

30 in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
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situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can be 
made soluble by removing transmembrane sequences, e.g., through recombinant methods. 
5 Furthermore, transmembrane proteins that have been made soluble can be made to be 
secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the bladder cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins may have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 

1 0 proteins are involved in numerous physiological events; e.g., if circulating, they often serve 
to transmit signals to various other cell types. The secreted protein may function in an 
autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting on 
cells in close proximity to the cell that secreted the factor), an endocrine manner (acting on 
cells at a distance, e.g, secretion into the blood stream), or exocrine (secretion, e.g., through a 

15 duct or to adjacent epithelial surface as sweat glands, sebaceous glands, pancreatic ducts, 
lacrimal glands, mammary glands, wax producing glands of the ear, etc.). Thus secreted 
molecules often find use in modulating or altering numerous aspects of physiology. Bladder 
cancer proteins that are secreted or released proteins are particularly preferred in the present 
invention as they serve as good targets for diagnostic markers, e.g., for blood, plasma, serum, 

20 or urine tests. Those which are enzymes may be antibody or small molecule targets. Others 
may be useful as vaccine targets, e.g., via CTL mechanisms. 

Use of bladder cancer nucleic acids 

As described above, bladder cancer sequence is initially identified by substantial 
25 nucleic acid and/or amino acid sequence homology or linkage to the bladder cancer 

sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, linked sequences on a mRNA are found on 
the same molecule. 
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The bladder cancer nucleic acid sequences of the invention, e.g., the sequences in 
Tables 1 A-13, can be fragments of larger genes, e.g., they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 
and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
5 sequences provided herein, extended sequences, in either direction, of the bladder cancer 
genes can be obtained, using techniques well known in the art for cloning either longer 
sequences or the full length sequences; see Ausubel, et al., supra. Much can be done by 
informatics and many sequences can be clustered to include multiple sequences 
corresponding to a single gene, e.g., systems such as UniGene (see, 

10 http://www.ncbi.nlm.nih.gov/UniGene/). 

Once a bladder cancer nucleic acid is identified, it can be cloned and, if necessary, its 
constituent parts recombined to form the entire bladder cancer nucleic acid coding regions or 
the entire mRNA sequence. Once isolated from its natural source, e.g., contained within a 
plasmid or other vector or excised therefrom as a linear nucleic acid segment, the 

1 5 recombinant bladder cancer nucleic acid can be further-used as a probe to identify and isolate 
other bladder cancer nucleic acids, e.g., extended coding regions. It can also be used as a 
"precursor" nucleic acid to make modified or variant bladder cancer nucleic acids and 
proteins. 

The bladder cancer nucleic acids of the present invention are used in several ways. In 
20 a first embodiment, nucleic acid probes to the bladder cancer nucleic acids are made and 

attached to biochips to be used in screening and diagnostic methods, as outlined below, or for 
administration, e.g., for gene therapy, vaccine, and/or antisense/inhibition applications. 
Alternatively, the bladder cancer nucleic acids that include coding regions of bladder cancer 
proteins can be put into expression vectors for the expression of bladder cancer proteins, 
25 again for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to bladder cancer nucleic acids (both 
the nucleic acid sequences outlined in the tables and/or the complements thereof) are made. 
The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the bladder cancer nucleic acids, e.g., the target sequence (either the target 
30 sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
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hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be a number of base pair 
mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
5 is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 
sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

10 A nucleic acid probe is generally single stranded but can be partially single and 

partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 
from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 

15 genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with either 
overlapping probes or probes to different sections of the target being used. That is, two, 
three, four or more probes, with three being preferred, are used to build in a redundancy for a 

20 particular target. The probes can be overlapping (e.g., have some sequence in common), or 
separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 

25 solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 

30 the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
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equivalents herein is meant that the two moieties, the solid support and the probe, are 
attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
5 support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways, as will be 
appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
10 the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or 
other grammatical equivalents herein is meant a material that can be modified to contain 
discrete individual sites appropriate for the attachment or association of the nucleic acid 
probes and is amenable to at least one detection method. As will be appreciated by those in 

15 the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 
copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 
polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 

20 plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. See WO 00/55627. 

Generally the substrate is planar, although as will be appreciated by those in the art, 
other configurations of substrates may be used as well. For example, the probes may be 
placed on the inside surface of a tube, for flow-through sample analysis to minimize sample 

25 volume. Similarly, the substrate may be flexible, such as a flexible foam, including closed 
cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
the biochip is derivatized with a chemical functional group including, but not limited to, 

30 amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
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particularly prefeired. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g., using linkers as are known in the art; e.g., 
homo-or hetero-bifiinctional linkers as are well known (see 1994 Pierce Chemical Company 
5 catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 

additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, and then 
attached to the surface of the solid support. As will be appreciated by those skilled in the art, 
10 either the 5 ! or 3' terminus may be attached to the solid support, or attachment may be via an 
internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very strong, 
yet non-covalent. For example, biotinylated oligonucleotides can be made, which bind to 
surfaces covalently coated with streptavidin, resulting in attachment. 

15 Alternatively, the oligonucleotides may be synthesized on the surface, as is known in 

the art. For example, photoactivation techniques utilizing photopolymerization compounds 
and techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in 
situ, using well known photolithographic techniques, such as those described in WO 
95/251 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited 

20 within, all of which are expressly incorporated by reference; these methods of attachment 
form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression level of 
bladder cancer-associated sequences. These assays are typically performed in conjunction 
with reverse transcription. In such assays, a bladder cancer-associated nucleic acid sequence 

25 acts as a template in an amplification reaction (e.g., Polymerase Chain Reaction, or PCR). In 
a quantitative amplification, the amount of amplification product will be proportional to the 
amount of template in the original sample. Comparison to appropriate controls provides a 
measure of the amount of bladder cancer-associated RNA. Methods of quantitative 
amplification are well known to those of skill in the art. Detailed protocols for quantitative 

41 



WO 03/003906 



PCT/US02/21338 



PCR are provided, e.g., in Innis, et al. (1990) PCR Protocols: A Guide to Methods and 
Applications Academic Press. 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
5 dye and a 3 ! quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5* fluorescent dye and the 3' 
quenching agent, thereby resulting in an increase in fluorescence as a function of 

10 amplification. See, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com. 
Other suitable amplification methods include, but are not limited to, ligase chain 
reaction (LCR) (see Wu and Wallace (1989) Genomics 4:560-569; Landegren, et al. (1988) 
Science 241:1077-1080; and Barringer, et al (1990) Gene 89:117-122), transcription 
amplification (Kwoh, et al. (1989) Proc. Natl Acad. Sci. USA 86:1173-1177), self-sustained 

15 sequence replication (Guatelli, et al. (1990) Proc. Natl Acad. Sci. USA 87:1874-1878), dot 
PCR, and linker adapter PCR, etc. 

Expression of bladder cancer proteins from nucleic acids 

In a preferred embodiment, bladder cancer nucleic acids, e.g., encoding bladder 
20 cancer proteins, are used to make a variety of expression vectors to express bladder cancer 
proteins which can then be used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in the art (see, e.g., 
Ausubel, supra, and Fernandez and Hoeffler (eds. 1999) Gene Expression Systems Academic 
Press) and are used to express proteins. The expression vectors may be either self-replicating 
25 extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
expression vectors include transcriptional and translation^ regulatory nucleic acid operably 
linked to the nucleic acid encoding the bladder c v ancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 
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promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 
5 operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in 
the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; a ribosome binding site is operably 
linked to a coding sequence if it is positioned so as to facilitate translation; two sequences 
may be operably linked if they are physically linked on a single polynucleotide. Generally, 

1 0 "operably linked" means that the DNA sequences being linked are contiguous, and, in the 
case of a secretory leader, contiguous and in reading phase. However, enhancers do not have 
to be contiguous. Linking is typically accomplished by ligation at convenient restriction 
sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are used in 
accordance with conventional practice. Transcriptional and translational regulatory nucleic 

15 acid will generally be appropriate to the host cell used to express the bladder cancer protein. 
Numerous types of appropriate expression vectors, and suitable regulatory sequences are 
known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, but are 
not limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 

20 sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The promoters 
may be either naturally occurring promoters or hybrid promoters. Hybrid promoters, which 
25 combine elements of more than one promoter, are useful in the present invention. 

In addition, an expression vector may comprise additional elements. The expression 
vector may have two replication systems, thus allowing it to be maintained in two organisms, 
e.g., in mammalian or insect cells for expression and in a procaryotic host for cloning and 
replication. For integrating expression vectors, the expression vector may contain at least one 
30 sequence homologous to the host cell genome, and preferably two homologous sequences 
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which flank the expression construct. The integrating vector may be directed to a specific 
locus in the host cell by selecting the appropriate homologous sequence for inclusion in the 
vector. Constructs for integrating vectors are well known (e.g., Fernandez and Hoeffler, 
supra). 

5 In addition, in a preferred embodiment, the expression vector contains a selectable 

marker gene to allow the selection of transformed host cells. Selection genes are well known 
in the art and will vary with the host cell used. 

The bladder cancer proteins of the present invention may be produced by culturing a 
host cell transformed with an expression vector under the appropriate conditions to induce or 

10 cause expression of the bladder cancer protein. Conditions appropriate for bladder cancer 
protein expression will vary with the choice of the expression vector and the host cell, and 
will be easily ascertained by one skilled in the art through routine experimentation or 
optimization. For example, the use of constitutive promoters in the expression vector will 
typically require optimizing the growth and proliferation of the host cell, while the use of an 

1 5 inducible promoter typically requires identifying the appropriate growth conditions for 
induction. In addition, in some embodiments, the timing of the harvest is important. For 
example, the baculoviral systems used in insect cell expression are lytic viruses, and harvest 
time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, insect, and 

20 animal cells, including mammalian cells. Of particular interest are Saccharomyces cerevisiae 
and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, Neurospora, BHK, 
CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial cells), THP1 cells (a 
macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the bladder cancer proteins are expressed in mammalian 

25 cells. Mammalian expression systems include retroviral and adenoviral systems. Retroviral 
vector systems are described in PCT/US97/01019 and PCT/US97/01048. Of particular use 
are promoters from mammalian viral genes, since viral genes are often highly expressed and 
have a broad host range. Examples include the S V40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 

30 CMV promoter (see, e.g., Fernandez and Hoeffler, supra). Typically, transcription 
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termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3' to the translation stop codon and thus, together with the promoter elements, 
flank the coding sequence. Examples of transcription terminator and polyadenlyation signals 
include those derived from SV40. 
5 Methods of introducing exogenous nucleic acid into mammalian and other hosts are 

well known, and will vary with the host cell used. Techniques include dextran-mediated 
transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast 
fusion, electroporation, viral infection, encapsulation of the polynucleotide(s) in liposomes, 
and direct microinjection of the DNA into nuclei. 

1 0 In another embodiment, bladder cancer proteins are expressed in bacterial systems. 

Promoters from bacteriophage may also be used. Synthetic promoters and hybrid promoters 
are also useful; e.g., the tac promoter is a hybrid of the trp and lac promoter sequences. A 
bacterial promoter can include naturally occurring promoters of non-bacterial origin that have 
the ability to bind bacterial RNA polymerase and initiate transcription. Often an efficient 

1 5 ribosome binding site is desirable. The expression vector may include a signal peptide 
sequence that provides for secretion of the bladder cancer protein. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may include a selectable marker gene to allow for the selection of bacterial 

20 strains that have been transformed. Suitable selection genes include genes which render the 
bacteria resistant to drugs, e.g., ampicillin, chloramphenicol, erythromycin, kanamycin, 
neomycin, and tetracycline, or biosynthetic genes, e.g., those in the histidine, tryptophan, and 
leucine biosynthetic pathways. These components are assembled into expression vectors. 
Expression vectors for bacteria include vectors for Bacillus subtilis, E. coli, Streptococcus 

25 cremoris, and Streptococcus lividans, among others (e.g., Fernandez and Hoeflfler, supra). 
The bacterial expression vectors are transformed into bacterial host cells using, e.g., calcium 
chloride treatment, electroporation, and other methods. 

Bladder cancer proteins can also be produced in insect cells. See, e.g., Miller, et al. 
(1997) Baculovirus Expression Vectors: A Laboratory Manual Oxford Books; ISBN: 

30 0716770172; and Makrides (1999) Prot. Expr. Purif. 17:183-202. 
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Bladder cancer protein may be produced in yeast cells. Yeast expression systems 
exist with expression vectors for Saccharomyces cerevisiae, Candida albicans and C. maltosa, 
Hansenula polymorphs Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. 
pastoris, Schizosaccharomyces pombe, and Yarrowia lipolytica. See, e.g., Jones, et al. (eds. 
5 1 993) The Molecular and Cellular Biology of the Yeast Saccharomyces: Gene Expression 
CSH Press; ISBN: 0879693657. 

The bladder cancer protein may also be made as a fusion protein, using techniques 
well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the desired 
epitope is small, the bladder cancer protein may be fused to a carrier protein to form an 

1 0 immunogen. Alternatively, the bladder cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the bladder cancer protein is a 
bladder cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression or purification purposes. 

The bladder cancer protein is typically purified or isolated after expression. Bladder 

1 5 cancer proteins may be isolated or purified in a variety of ways, depending on what other 

components are present in the sample. Standard purification methods include electrophoretic, 
molecular, immunological, and chromatographic techniques, including ion exchange, 
hydrophobic, affinity, reverse-phase HPLC chromatography, and chromatofocusing. The 
bladder cancer protein may be purified using a standard anti-bladder cancer protein antibody 

20 affinity column. Ultrafiltration and diafiltration techniques, in conjunction with protein 

concentration, are also useful. For general guidance in suitable purification techniques, see, 
e.g., Scopes (1982) Protein Purification Springer- Verlag. The degree of purification 
necessary will vary depending on the use of the bladder cancer protein. In some instances no 
purification will be necessary, which may depend on the intended use. 

25 Once expressed and purified, if necessary, the bladder cancer proteins and nucleic 

acids are useful in a number of applications. They may be used as iromunoselection reagents, 
as vaccine reagents, as screening agents, etc. 

Variants of bladder cancer proteins 
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In one embodiment, the bladder cancer proteins are derivative or variant bladder 
cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative bladder cancer peptide will often contain at least one amino acid substitution, 
deletion, or insertion, with amino acid substitutions being particularly preferred. The amino 
5 acid substitution, insertion, or deletion may occur at most residues within the bladder cancer 
peptide. 

Certain embodiments of bladder cancer proteins of the present invention are amino 
acid sequence variants. These variants typically fall into one or more of three classes: 
substitutional, insertional, or deletional variants. These variants ordinarily are prepared by 

10 site specific mutagenesis of nucleotides in the DNA encoding the bladder cancer protein, 
using cassette or PCR, mutagenesis, or other techniques well known in the art, to produce 
DNA encoding the variant, and thereafter expressing the DNA in recombinant cell culture as 
outlined above. However, variant bladder cancer protein fragments having up to about 100- 
150 residues may be prepared by in vitro synthesis using established techniques. Amino acid 

1 5 sequence variants are often characterized by the predetermined nature of the variation, a 
feature that sets them apart from naturally occurring allelic or interspecies variation of the 
bladder cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

20 While the site or region for introducing an amino acid sequence variation is often 

predetermined, the mutation per se need not be predetermined. To optimize the performance 
of a mutation at a given site, random mutagenesis may be conducted at the target codon or 
region and the expressed bladder cancer variants screened for the optimal combination of 
desired activities. Techniques for making substitution mutations at predetermined sites in 

25 DNA having a known sequence are well known, e.g., Ml 3 primer mutagenesis and PCR 
mutagenesis. Screening of the mutants is performed using assays of bladder cancer protein 
activities. 

Amino acid substitutions are typically of single residues; insertions usually will be on 
the order of from about 1 to 20 amino acids, although considerably larger insertions may be 
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tolerated. Deletions range from about 1-20 residues, although in some cases deletions may be 
much larger. 

Substitutions, deletions, insertions, or combinations thereof may be used to arrive at a 
final derivative. Generally these changes are done on a few amino acids to minimize the 
5 alteration of the molecule. However, larger changes may be tolerated in certain 

circumstances. When small alterations in the characteristics of the bladder cancer protein are 
desired, substitutions are generally made in accordance with the amino acid substitution 
relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and elicit the 

10 same immune response as the naturally-occurring analog, although variants also are selected 
to modify the characteristics of the bladder cancer proteins as needed. Alternatively, the 
variant may be designed such that the biological activity of the bladder cancer protein is 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by selecting 

15 substitutions that are less conservative than those described above. Substitutions may be 

made which more significantly affect: the structure of the polypeptide backbone in the area of 
the alteration, e.g., the alpha-helical or beta-sheet structure; the charge or hydrophobicity of 
the molecule at the target site; or the bulk of the side chain. Substitutions which are expected 
to produce the greatest changes in the polypeptide's properties are those in which (a) a 

20 hydrophilic residue, e.g., serine or threonine is substituted for (or by) a hydrophobic residue, 
e.g., leucine, isoleucine, phenylalanine, valine, or alanine; (b) a cysteine or proline is 
substituted for (or by) another residue; (c) a residue having an electropositive side chain, e.g., 
lysine, arginine, or histidine, is substituted for (or by) an electronegative residue, e.g., 
glutamic acid or aspartic acid; or (d) a residue having a bulky side chain, e.g., phenylalanine, 

25 is substituted for (or by) one not having a side chain, e.g., glycine. 

Covalent modifications of bladder cancer polypeptides are included within the scope 
of this invention. One type of covalent modification includes reacting targeted amino acid 
residues of a bladder cancer polypeptide with an organic derivatizing agent that is capable of 
reacting with selected side chains or the N-or C-terminal residues of a bladder cancer 

30 polypeptide. Derivatization with Afunctional agents is useful, e.g., for crosslinking bladder 
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cancer polypeptides to a water-insoluble support matrix or surface for use in the method for 
purifying anti-bladder cancer polypeptide antibodies or screening assays. Commonly used 
crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N- 
hydroxysuccinimide esters, e.g., esters with 4-azidosalicylic acid, homobifunctional 
5 imidoesters, including disuccinimidyl esters such as 3,3'-ditWobis(succinimidylpropionate), 
Afunctional maleimides such as bis-N-maleimido-l,8-octane and agents such as methyl-3- 
((p-azidophenyl)dithio)propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl residues to 
the corresponding glutamic and aspartic residues, respectively, hydroxylation of proline and 

1 0 lysine, phosphorylation of hydroxyl groups of serine, threonine or tyrosine residues, 

methylation of the amino groups of the lysine, arginine, and histidine side chains (pp. 79-86, 
Creighton (1984) Proteins: Structure and Molecular Properties Freeman), acetylation of the 
N-terminal amine, and amidation of a C-terminal carboxyl group. 

Another type of covalent modification of the bladder cancer polypeptide included 

15 within the scope of this invention comprises altering the native glycosylation pattern of the 
polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein to 
mean deleting one or more carbohydrate moieties found in native sequence bladder cancer 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native 
sequence bladder cancer polypeptide. Glycosylation patterns can be altered in many ways. 

20 For example the use of different cell types to express bladder cancer-associated sequences 
can result in different glycosylation patterns. 

Addition of glycosylation sites to bladder cancer polypeptides may also be 
accomplished by-altering the amino acid sequence thereof. The alteration may be made, e.g., 
by the addition of, or substitution by, one or more serine or threonine residues to the native 

25 sequence bladder cancer polypeptide (for O-linked glycosylation sites). The bladder cancer 
amino acid sequence may optionally be altered through changes at the DNA level, 
particularly by mutating the DNA encoding the bladder cancer polypeptide at preselected 
bases such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the bladder 

30 cancer polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. 
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Such methods are described in the art, e.g., in WO 87/05330, and pp. 259-306 in Aplin and 
Wriston (1981) CRC Crit. Rev. Biochem. 

Removal of carbohydrate moieties present on the bladder cancer polypeptide may be 
accomplished chemically or enzymatically or by mutational substitution of codons encoding 
5 amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 
techniques are known in the art. See, e.g., Hakimuddin, et al. (1987) Arch. Biochem. 
Biophvs. 259:52-57; and Edge, et al. (1981) Anal. Biochem. 118:131-137. Enzymatic 
cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of 
endo-and exo-glycosidases. See, e.g., Thotakura, et al. (1987) Meth. Enzvmol. 138:350-359. 

10 Another type of covalent modification of bladder cancer comprises linking the 

bladder cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192; or 4,179,337. 

Bladder cancer polypeptides of the present invention may also be modified to form 

1 5 chimeric molecules comprising a bladder cancer polypeptide fused to a heterologous 

polypeptide or amino acid sequence. In one embodiment, a chimeric molecule comprises a 
fusion of a bladder cancer polypeptide with an epitope tag. The epitope tag is generally 
placed at the amino-or carboxyl-terminus of the bladder cancer polypeptide. The presence of 
such epitope-tagged forms of a bladder cancer polypeptide can be detected using an antibody 

20 against the tag polypeptide. Also, provision of the epitope tag enables the bladder cancer 
polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a bladder cancer polypeptide with an 
immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the 

25 chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the art. 
Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 
and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field, et al. 
(1988) Mol. Cell. Biol. 8:2159-2165); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7, and 

30 9E10 antibodies thereto (Evan, et al. (1985) Molecular and Cellular Biology 5:3610-3616); 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky, et al. 
(1990) Protein Engineering 3:547-553). Other tag polypeptides include the Flag-peptide 
(Hopp, et al. (1988) BioTechnology 6:1204-1210); the KT3 epitope peptide (Martin, et al 
(1992) Science 255:192-194); tubulin epitope peptide (Skinner, et al. (1991) J. Biol. Chem. 
5 266:15163-15166); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth, et al. (1990) 
Proc. Nat'l Acad. Sci. USA 87:6393-6397). 

Also included are other bladder cancer proteins of the bladder cancer family, and 
bladder cancer proteins from other organisms, which are cloned and expressed as outlined 
below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 

1 0 used to find other related bladder cancer proteins from humans or other organisms. As will 
be appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the bladder cancer nucleic acid sequence. Preferred PCR primers 
are from about 15-35 nucleotides in length, with from about 20-30 being preferred, and may 
contain inosine as needed. The conditions for the PCR reaction are well known. See, e.g., 

1 5 Innis ( 1 990) PCR Protocols^ supra. 

Antibodies to bladder cancer proteins 

In a preferred embodiment, when the bladder cancer protein is to be used to generate 
antibodies, e.g., for immunotherapy or immunodiagnosis, the bladder cancer protein should 

20 share at least one epitope br determinant with the full length protein. By "epitope" or 

"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller bladder cancer protein will be able to bind to the full-length protein, 
particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 

25 antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known (see, e.g., Coligan, supra; and 
Harlow and Lane, supra). Polyclonal antibodies can be raised in a mammal, e.g., by one or 
more injections of an immunizing agent and, if desired, an adjuvant. Typically, the 
immunizing agent and/or adjuvant will be injected in the mammal by multiple subcutaneous 

30 or intraperitoneal injections. The immunizing agent may include a protein encoded by a 
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nucleic acid of the tables or fragment thereof or a fusion protein thereof. It may be useful to 
conjugate the immunizing agent to a protein known to be immunogenic in the mammal being 
immunized. Examples of such immunogenic proteins include but are not limited to keyhole 
limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. 
5 Examples of adjuvants which may be employed include Freund's complete adjuvant and 
MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). The 
immunization protocol may be selected as appropriate. 

The antibodies may be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and Milstein (1975) 

1 0 Nature 256:495-497. In a hybridoma method, a mouse, hamster, or other appropriate host 
animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce 
or are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. The immunizing agent will 
typically include a polypeptide encoded by a nucleic acid of Tables 1 A- 13 or fragment 

1 5 thereof, or a fusion protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") 
are used if cells of human origin are desired, or spleen cells or lymph node cells are used if 
non-human mammalian sources are desired. The lymphocytes are then fused with an 
immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell (pp. 59-103, Goding (1986) Monoclonal Antibodies: Principles and Practice 

20 Academic Press). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfiised, immortalized cells. For example, if the parental cells lack the enzyme 

25 hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

In one embodiment, the antibodies are bispecific antibodies. Bispecific antibodies are 
typically monoclonal, preferably human or humanized, antibodies that have binding 

30 specificities for at least two different antigens or that have binding specificities for two 
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epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
protein encoded by a nucleic acid Tables 1A-13 or a fragment thereof, the other one is for 
another antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
5 multivalent reagents. 

In a preferred embodiment, the antibodies to bladder cancer protein are capable of 
reducing or eliminating a biological function of a bladder cancer protein, as is described 
below. That is, the addition of anti-bladder cancer protein antibodies (either polyclonal or 
preferably monoclonal) to bladder cancer tissue (or cells containing bladder cancer) may 

10 reduce or eliminate the bladder cancer. Generally, at least about 25% decrease in activity, 
growth, size, or the like is preferred, with at least about 50% being particularly preferred, and 
about a 95-100% decrease being especially preferred. 

In a preferred embodiment the antibodies to the bladder cancer proteins are 
humanized antibodies (e.g., Xenerex Biosciences; Medarex, Inc.; Abgenix, Inc.; Protein 

15 Design Labs, Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain 
minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues from a complementary 

20 determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat, or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 

25 imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 

30 at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
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immunoglobulin. See Jones, et al. (1986) Nature 321:522-525; Riechmann, et al. (1988) 
Nature 332:323-329; and Presta (1992) Curr. Op. Struct, Biol. 2:593-596. Humanization can 
be performed, e.g., following the method of Winter and co-workers (see Jones, et al. (1986) 
Nature 321 :522-525; Riechmann, et al. (1988) Nature 332:323-327; Verhoeyen, et al. (1988) 
5 Science 239: 1534-1536), by substituting rodent CDRs or CDR sequences for the 

corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

1 0 Human antibodies can also be produced using various techniques known in the art, 

including phage display libraries (Hoogenboom and Winter (1991) J. Mol. Biol. 227:381- 
388; Marks, et al. (1991) J. Mol. Biol. 222:581-597) or of human monoclonal antibodies 
(e.g., p. 77, Cole, et al. in Reisfeld and Sell (1985) Monoclonal Antibodies and Cancer 
Therapy Liss; and Boerner, et al. (1991) J. Immunol. 147:86-95). Similarly, human 

15 antibodies can be made by introducing of human immunoglobulin loci into transgenic 

animals, e.g., mice in which the endogenous immunoglobulin genes have been partially or 
completely inactivated. Upon challenge, human antibody production is observed, which 
closely resembles that seen in humans in all respects, including gene rearrangement, 
assembly, and antibody repertoire. This approach is described, e.g., in U.S. Patent Nos. 

20 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and Marks, et al. (1992) 
Bio/Technology 10:779-783; Lonberg, et al. (1994) Nature 368:856-859; Morrison (1994) 
Nature 368:812-13; Neuberger (1996) Nature Biotechnology 14:826 commenting on 
Fishwild, et al. (1996) Nature Biotechnology 14:845-51; and Lonberg and Huszar (1995) 
Intern. Rev. Immunol. 13:65-93. 

25 By immunotherapy is meant treatment of bladder cancer with an antibody raised 

against bladder cancer proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 
(patient), which may be used to target a label or toxin. Active immunization is the induction 
of antibody and/or T-cell responses in a recipient (patient). Induction of an immune response 

30 is the result of providing the recipient with an antigen to which antibodies are raised. As 
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appreciated by one of ordinary skill in the art, the antigen may be provided by injecting a 
polypeptide against which antibodies are desired to be raised into a recipient, or contacting 
the recipient with a nucleic acid capable of expressing the antigen and under conditions for 
expression of the antigen, leading to an immune response. 
5 In a preferred embodiment the bladder cancer proteins against which antibodies are 

raised are secreted proteins as described above. Without being bound by theory, antibodies 
used for treatment, bind and prevent the secreted protein from binding to its receptor, thereby 
inactivating the secreted bladder cancer protein. 

In another preferred embodiment, the bladder cancer protein to which antibodies are 

10 raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment, bind the extracellular domain of the bladder cancer protein and prevent it from 
binding to other proteins, such as circulating ligands or cell-associated molecules. The 
antibody may cause down-regulation of the transmembrane bladder cancer protein. As will 
be appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 

1 5 competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
bladder cancer protein. The antibody is also an antagonist of the bladder cancer protein. 
Further, the antibody prevents activation of the transmembrane bladder cancer protein. In 
one aspect, when the antibody prevents the binding of other molecules to the bladder cancer 
protein, the antibody prevents growth of the cell. The antibody may also be used to target or 

20 sensitize the cell to cytotoxic agents, including, but not limited to TNF-oc, TNF-P, IL-1, INF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 
methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, bladder cancer is 

25 treated by administering to a patient antibodies directed against the transmembrane bladder 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 
otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated to an effector moiety. 
The effector moiety can be a number of molecules, including labelling moieties such as 

30 radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect the 
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therapeutic moiety is a small molecule that modulates the activity of the bladder cancer 
protein. In another aspect the therapeutic moiety modulates the activity of molecules 
associated with or in close proximity to the bladder cancer protein. The therapeutic moiety 
may inhibit enzymatic activity such as protease or collagenase or protein kinase activity 
5 associated with bladder cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic agent. In 
this method, targeting the cytotoxic agent to bladder cancer tissue or cells, results in a 
reduction in the number of afflicted cells, thereby reducing symptoms associated with bladder 
cancer. Cytotoxic agents are numerous and varied and include, but are not limited to, 

1 0 cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 

corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin, and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against bladder cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 

1 5 the antibody. Targeting the therapeutic moiety to transmembrane bladder cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the bladder cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the therapeutic moiety. 

In another preferred embodiment, the bladder cancer protein against which the 

20 antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
the individual or cell. Moreover, wherein the bladder cancer protein can be targeted within a 
cell, e.g., the nucleus, an antibody thereto contains a signal for that target localization, e.g., a 

25 nuclear localization signal. 

The bladder cancer antibodies of the invention specifically bind to bladder cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
IQj of at least about 0.1 mM, more usually at least about 1 ^M, preferably at least about 0.1 

(iM or better, and most preferably, 0.01 |iM or better. Selectivity of binding is also 
30 important. 
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Detection of bladder cancer sequence for diagnostic and therapeutic applications 

In one aspect, the RNA expression levels of genes are determined for different 
cellular states in the bladder cancer phenotype. Expression levels of genes in normal tissue 
5 (e.g., not experiencing bladder cancer) and in bladder cancer tissue (and in some cases, for 
varying severities of bladder cancer that relate to prognosis, as outlined below), or in non- 
malignant disease, are evaluated to provide expression profiles. An expression profile of a 
particular cell state or point of development is essentially a "fingerprint" of the state. While 
two states may have a particular gene similarly expressed, the evaluation of a number of 

10 genes simultaneously allows the generation of a gene expression profile that is reflective of 
the state of the cell. By comparing expression profiles of cells in different states, information 
regarding which genes are important (including both up- and down-regulation of genes) in 
each of these states is obtained. Then, diagnosis may be performed or confirmed to 
determine whether a tissue sample has the gene expression profile of normal or cancerous 

1 5 tissue. This will provide for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 

20 normal versus bladder cancer tissue. Genes may be turned on or turned off in a particular 
state, relative to another state thus permitting comparison of two or more states. A 
qualitatively regulated gene will exhibit an expression pattern within a state or cell type 
which is detectable by standard techniques. Some genes will be expressed in one state or cell 
type, but not in both. Alternatively, the difference in expression maybe quantitative, e.g., in 

25 that expression is increased or decreased; e.g., gene expression is either upregulated, resulting 
in an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
GeneChip™ expression arrays. See Lockhart (1996) Nature Biotechnology 14:1675-1680. 

30 Other techniques include, but are not limited to, quantitative reverse transcriptase PCR, 
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northern analysis and RNase protection. As outlined above, preferably the change in 
expression (e.g., upregulation or downregulation) is at least about 50%, more preferably at 
least about 100%, more preferably at least about 150%, more preferably at least about 200%, 
with from about 300-1000% being especially preferred. 
5 Evaluation may be at the gene transcript, or the protein level. The amount of gene 

expression may be monitored using nucleic acid probes to the DNA or RNA equivalent of the 
gene transcript, and the quantification of gene expression levels, or, alternatively, the final 
gene product itself (protein) can be monitored, e.g., with antibodies to the bladder cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 

10 spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to bladder 
cancer genes, e.g., those identified as being important in a bladder cancer or disease 
phenotype, can be evaluated in a bladder cancer diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed simultaneously 
on a number of genes. Multiple protein expression monitoring can be performed as well. 

15 Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the bladder cancer nucleic acid probes are attached to biochips as 
outlined herein for the detection and quantification of bladder cancer sequences in a 
particular cell. The assays are further described below in the example. PCR techniques can 
be used to provide greater sensitivity. 

20 In a preferred embodiment nucleic acids encoding the bladder cancer protein are 

detected. Although DNA or RNA encoding the bladder cancer protein may be detected, of 
particular interest are methods wherein an mRNA encoding a bladder cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 

25 oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 

30 permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
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probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example, a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
bladder cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
5 secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins as 
described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The bladder cancer proteins, antibodies, nucleic acids, modified proteins and cells 
10 containing bladder cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

15 As described and defined herein, bladder cancer proteins, including intracellular, 

transmembrane or secreted proteins, find use as diagnostic or prognostic markers of bladder 
cancer, or to assist in selecting therpay based on expression profile and archival data. 
Detection of these proteins in putative bladder cancer tissue allows for detection or diagnosis 
of bladder cancer. In one embodiment, antibodies are used to detect bladder cancer proteins. 

20 A preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the bladder cancer protein is 
detected, e.g., by immunoblotting with antibodies raised against the bladder cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

25 In another preferred method, antibodies to the bladder cancer protein find use in in 

situ imaging techniques, e.g., in histology (e.g., Asai (ed. 1993) "Antibodies in Cell Biology" 
Methods in Cell Biology (vol. 37). In this method cells are contacted with from one to many 
antibodies to the bladder cancer protein(s). Following washing to remove non-specific 
antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 

30 the antibody is detected by incubating with a secondary antibody that contains a detectable 
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label. In another method the primary antibody to the bladder cancer protein(s) contains a 
detectable label, e.g. an enzyme marker that can act on a substrate. In another preferred 
embodiment each one of multiple primary antibodies contains a distinct and detectable label. 
This method finds particular use in simultaneous screening for a plurality of bladder cancer 
5 proteins. As will be appreciated by one of ordinary skill in the art, many other histological 
imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the ability 
to detect and distinguish emissions of different wavelengths. In addition, a fluorescence 
activated cell sorter (FACS) can be used in the method. 

10 In another preferred embodiment, antibodies find use in diagnosing bladder cancer 

from blood, serum, plasma, stool, urine, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of bladder cancer proteins. 
Antibodies can be used to detect a bladder cancer protein by previously described 
immunoassay techniques including ELIS A, immunoblotting (western blotting), 

15 immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 

antibodies may indicate an immune response against an endogenous bladder cancer protein. 

In a preferred embodiment, in situ hybridization of labeled bladder cancer nucleic 
acid probes to tissue arrays is done. For example, arrays of tissue samples, including bladder 
cancer tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, supra) 

20 is then performed. When comparing the fingerprints between an individual and a standard, 
the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the findings. It 
is further understood that the genes which indicate the diagnosis may differ from those which 
indicate the prognosis and molecular profiling of the condition of the cells may lead to 
distinctions between responsive or refractory conditions or may be predictive of outcomes. 

25 In a preferred embodiment, the bladder cancer proteins, antibodies, nucleic acids, 

modified proteins and cells containing bladder cancer sequences are used in prognosis assays. 
As above, gene expression profiles can be generated that correlate to bladder cancer, clinical, 
pathological, or other information, e.g., in terms of long term prognosis. Again, this may be 
done on either a protein or gene level, with the use of genes being preferred. Single or 

30 multiple genes may be useful in various combinations. As above, bladder cancer probes may 
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be attached to biochips for the detection and quantification of bladder cancer sequences in a 
tissue or patient. The assays proceed as outlined above for diagnosis. PCR methods may 
provide more sensitive and accurate quantification. 

5 Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and antibodies as 
described herein are used in drug screening assays. The bladder cancer proteins, antibodies, 
nucleic acids, modified proteins and cells containing bladder cancer sequences are used in 
drug screening assays or by evaluating the effect of drug candidates on a "gene expression 

10 profile" or expression profile of polypeptides. In a preferred embodiment, the expression 
profiles are used, preferably in conjunction with high throughput screening techniques to 
allow monitoring for expression profile genes after treatment with a candidate agent. See, 
e.g., Zlokarnik, et al. (1998) Science 279:84-88; and Heid (1996) Genome Res. 6:986-94. 
In a preferred embodiment, the bladder cancer proteins, antibodies, nucleic acids, 

1 5 modified proteins and cells containing the native or modified bladder cancer proteins are used 
in screening assays. That is, the present invention provides novel methods for screening for 
compositions which modulate the bladder cancer phenotype or an identified physiological 
function of a bladder cancer protein. As above, this can be done on an individual gene level 
or by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 

20 embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

Having identified the differentially expressed genes herein, a variety of assays may be 
executed. In a preferred embodiment, assays may be run on an individual gene or protein 

25 level. That is, having identified a particular gene as up regulated in bladder cancer, test 

compounds can be screened for the ability to modulate gene expression or for binding to the 
bladder cancer protein. "Modulation" thus includes both an increase and a decrease in gene 
expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing bladder cancer, with changes of at least 

30 about 10%, preferably about 50%, more preferably about 100-300%, and in some 
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embodiments about 300-1000% or greater. Thus, if a gene exhibits about 4-fold increase in 
bladder cancer tissue compared to normal tissue, a decrease of about four-fold is often 
desired; similarly, about 10-fold decrease in bladder cancer tissue compared to normal tissue 
often provides a target value of about 10-fold increase in expression to be induced by the test 
5 compound. 

The amount of gene expression may be monitored using nucleic acid probes and the 
quantification of gene expression levels, or, alternatively, the gene product itself can be 
monitored, e.g., through the use of antibodies to the bladder cancer protein and standard 
immunoassays. Proteomics and separation techniques may also allow quantification of 
10 expression. 

In a preferred embodiment, gene expression or protein monitoring of a number of 
entities, e.g., an expression profile, is monitored simultaneously. Such profiles will typically 
involve a plurality of those entities described herein.. 

In this embodiment, the bladder cancer nucleic acid probes are attached to biochips as 
1 5 outlined herein for the detection and quantification of bladder cancer sequences in a 

particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, may 
be used with dispensed primers in desired wells. A PCR reaction can then be performed and 
analyzed for each well. 

Expression monitoring can be performed to identify compounds that modify the 
20 expression of one or more bladder cancer-associated sequences, e.g., a polynucleotide 

sequence set out inTables 1 A-13. Generally, in a preferred embodiment, a test modulator is 
added to the cells prior to analysis. Moreover, screens are also provided to identify agents 
that modulate bladder cancer, modulate bladder cancer proteins, bind to a bladder cancer 
protein, or interfere with the binding of a bladder cancer protein and an antibody or other 
25 binding partner. 

The term "test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes a molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the bladder cancer phenotype or the expression of a bladder cancer sequence, 
30 e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
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expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a bladder cancer phenotype, e.g., to a normal tissue or 
non-malignant fingerprint. In another embodiment, a modulator induced a bladder cancer 
phenotype. Generally, a plurality of assay mixtures are run in parallel with different agent 
5 concentrations to obtain a differential response to the various concentrations. Typically, one 
of these concentrations serves as a negative control, e.g., at zero concentration or below the 
level of detection. 

Drug candidates encompass numerous chemical classes, though typically they are 
organic molecules, preferably small organic compounds having a molecular weight of more 

10 than about 100 and less than about 2,500 daltons. Preferred small molecules are less than 
about 2000, or less than about 1500 or less than about 1000 or less than about 500 D. 
Candidate agents comprise functional groups necessary for structural interaction with 
proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, 
hydroxy], or carboxyl group, preferably at least two of the functional chemical groups. The 

15 candidate agents often comprise cyclical carbon or heterocyclic structures and/or aromatic or 
polyaromatic structures substituted with one or more of the above functional groups. 
Candidate agents are also found among biomolecules including peptides, saccharides, fatty 
acids, steroids, purines, pyrimidines, derivatives, structural analogs or combinations thereof. 
Particularly preferred are peptides. 

20 In one aspect, a modulator will neutralize the effect of a bladder cancer protein. By 

"neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell. 

In certain embodiments, combinatorial libraries of potential modulators will be 
screened for an ability to bind to a bladder cancer polypeptide or to modulate activity. 
25 Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 
are employed for such an analysis. 
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In one preferred embodiment, high throughput screening methods involve providing a 
library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
5 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial chemical 

10 library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of chemical 
building blocks called amino acids in every possible way for a given compound length (e.g., 
the number of amino acids in a polypeptide compound). Millions of chemical compounds 
can be synthesized through such combinatorial mixing of chemical building blocks. See, e.g., 
Gallop, et al. (1994) J. Med. Chem. 37:1233-1251. 

15 Preparation and screening of combinatorial chemical libraries is well known to those 

of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Pent. Prot. Res. 37:487- 
493, Houghton, et al. (1991) Nature 354:84-88); peptoids (PCT Publication No WO 
91/19735); encoded peptides (PCT Publication WO 93/20242); random bio-oligomers (PCT 

20 Publication WO 92/00091); benzodiazepines (U.S. Pat. No. 5,288,514); diversomers such as 
hydantoins, benzodiazepines, and dipeptides (Hobbs, et al. (1993) Proc. Nat'l Acad. Sci. USA 
90:6909-6913); vinylogous polypeptides (Hagihara, et al. (1992) J. Amer. Chem. Soc. 
1 14:6568-6570); nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Hirschmann, et al. (1992) J. Amer. Chem. Soc. 1 14:9217-9218); analogous organic 

25 syntheses of small compound libraries (Chen, et al. (1994) J. Amer. Chem. Soc. 1 16:2661- 
2662); oligocarbamates (Cho, et al. (1993) Science 261:1303-1305); and/or peptidyl 
phosphonates (Campbell, et al. (1994) J. Org. Chem. 59:658-xxx). See, generally, Gordon, et 
al. (1994) J. Med. Chem. 37:1385-1401, nucleic acid libraries (see, e.g., Strategene, Corp.); 
peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083); antibody libraries (see, e.g., 

30 Vaughn, et al. (1996) Nature Biotechnology 14:309-314, and PCT/US96/10287); 
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carbohydrate libraries (Liang, et al. (1996) Science 274:1520-1522, and U.S. Patent No. 
5,593,853); and small organic molecule libraries (see, e.g., benzodiazepines, Baum (p. 33, 
Jan 18, 1993) C&ENews) : isoprenoid (U.S. Patent No. 5,569,588); thiazolidinones and 
metathiazanones (U.S. Patent No. 5,549,974); pyrrolidines (U.S. Patent Nos. 5,525,735 and 
5 5,519,134); morpholino compounds (U.S. Patent No. 5,506,337); benzodiazepines (U.S. 
Patent No. 5,288,514); and the like. 

Devices for the preparation of combinatorial libraries are commercially available. 
See, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY; Symphony, Rainin, 
Woburn, MA; 433A Applied Biosystems, Foster City, CA; and 9050 Plus, Millipore, 
10 Bedford, MA. 

A number of well known robotic systems have also been developed for solution phase 
chemistries. These systems include automated workstations like the automated synthesis 
apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 
systems utilizing robotic arms (Zymate n, Zymark Corporation, Hopkinton, MA; Orca, 

15 Hewlett-Packard, Palo Alto, CA), which mimic the manual synthetic operations performed by 
a chemist. The above devices are suitable for use with the present invention. The nature and 
implementation of modifications to these devices (if any) so that they can operate as 
discussed herein will be apparent to persons skilled in the relevant art. In addition, numerous 
combinatorial libraries are themselves commercially available (see, e.g., ComGenex, 

20 Princeton, NJ; Asinex, Moscow, Ru; Tripos, Inc., St. Louis, MO; ChemStar, Ltd, Moscow, 
RU; 3D Pharmaceuticals, Exton, PA; Martek Biosciences, Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of bladder cancer gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 

25 polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other properties 
of particular nucleic acids or protein products are well known to those of skill in the art. 
Similarly, binding assays and reporter gene assays are similarly well known. Thus, e.g., U.S. 
Patent No. 5,559,410 discloses high throughput screening methods for proteins, U.S. Patent 

30 No. 5,585,639 discloses high throughput screening methods for nucleic acid binding (e.g., in 
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arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high throughput methods of 
screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available (see, e.g., 
Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
5 Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA; etc.). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibility and customization. The manufacturers of such systems provide 
10 detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring proteins or 
fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing proteins, or 
1 5 random or directed digests of proteinaceous cellular extracts, may be used. In this way 

libraries of proteins may be made for screening in the methods of the invention. Particularly 
preferred in this embodiment are libraries of bacterial, fungal, viral, and mammalian proteins, 
with the latter being preferred, and human proteins being especially preferred. Particularly 
usefiiil test compound will be directed to the class of proteins to which the target belongs, e.g., 
20 substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5-30 amino acids, 
with from about 5-20 amino acids being preferred, and from about 7-15 being particularly 
preferred. The peptides may be digests of naturally occurring proteins as is outlined above, 
random peptides, or "biased" random peptides. By "randomized" or grammatical equivalents 
25 herein is meant that each nucleic acid and peptide consists of essentially random nucleotides 
and amino acids, respectively. Since generally these random peptides (or nucleic acids, 
discussed below) are chemically synthesized, they may incorporate nucleotide or amino acid 
substitutions. The synthetic process can be designed to generate randomized proteins or 
nucleic acids, to allow the formation of all or most of the possible combinations over the 
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length of the sequence, thus forming a library of randomized candidate bioactive 
proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence preferences or 
constants. In a preferred embodiment, the library is biased. That is, some positions within 
5 the sequence are either held constant, or are selected from a limited number of possibilities. 
For example, in a preferred embodiment, the nucleotides or amino acid residues are 
randomized within a defined class, e.g., of hydrophobic amino acids, hydrophilic residues, 
sterically biased (either small or large) residues, towards the creation of nucleic acid binding 
domains, the creation of cysteines, for cross-linking, prolines for SH-3 domains, serines, 

10 threonines, tyrosines or histidines for phosphorylation sites, etc., or to purines, etc. 

Modulators of bladder cancer can also be nucleic acids, as defined above. 
As described above generally for proteins, nucleic acid modulating agents may be 
naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
Digests of procaryotic or eucaryotic genomes may be used as is outlined above for proteins. 

15 In a preferred embodiment, the candidate compounds are organic chemical moieties, a 

wide variety of which are available in the literature. 

After the candidate agent has been added and the cells allowed to incubate for some 
period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 

20 the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PCR performed as appropriate. For example, 
an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a fluorescent, a 

25 chemiluminescent, a chemical, or a radioactive signal, to provide a means of detecting the 
target sequence's specific binding to a probe. The label also can be an enzyme, such as, 
alkaline phosphatase or horseradish peroxidase, which when provided with an appropriate 
substrate produces a product that can be detected. Alternatively, the label can be a labeled 
compound or small molecule, such as an enzyme inhibitor, that binds but is not catalyzed or 

30 altered by the enzyme. The label also can be a moiety or compound, such as, an epitope tag 
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or biotin which specifically binds to streptavidin. For the example of biotin, the streptavidin 
is labeled as described above, thereby, providing a detectable signal for the bound target 
sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct hybridization 
5 assays or can comprise "sandwich assays", which include the use of multiple probes, as is 
generally outlined in US. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 5,594,1 17, 
5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 5,594,118, 
5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by reference. In 
this embodiment, in general, the target nucleic acid is prepared as outlined above, and then 

1 0 added to the biochip comprising a plurality of nucleic acid probes, under conditions that 
allow the formation of a hybridization complex. 

A variety of hybridization conditions may be used in the present invention, including 
high, moderate and low stringency conditions as outlined above. The assays are generally 
run under stringency conditions which allows formation of the label probe hybridization 

1 5 complex only in the presence of target. Stringency can be controlled by altering a step 
parameter that is a thermodynamic variable, including, but not limited to, temperature, 
formamide concentration, salt concentration, chaotropic salt concentration pH, organic 
solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is generally 

20 outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain steps at 
higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. Components 
of the reaction may be added simultaneously, or sequentially, in different orders, with 
preferred embodiments outlined below. In addition, the reaction may include a variety of 

25 other reagents. These include salts, buffers, neutral proteins, e.g., albumin, detergents, etc., 
which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 
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The assay data are analyzed to determine the expression levels, and changes in 
expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the bladder cancer phenotype. In one 
embodiment, screening is performed to identify modulators that can induce or suppress a 
5 particular expression profile, thus preferably generating the associated phenotype. In another 
embodiment, e.g., for diagnostic applications, having identified differentially expressed genes 
important in a particular state, screens can be performed to identify modulators that alter 
expression of individual genes. In an another embodiment, screening is performed to identify 
modulators that alter a biological function of the expression product of a differentially 
10 expressed gene. Again, having identified the importance of a gene in a particular state, 

screens are performed to identify agents that bind and/or modulate the biological activity of 
the gene product. 

In addition screens can be done for genes that are induced in response to a candidate 
agent. After identifying a modulator based upon its ability to suppress a bladder cancer 

15 expression pattern leading to a normal expression pattern, or to modulate a single bladder 
cancer gene expression profile so as to mimic the expression of the gene from normal tissue, 
a screen as described above can be performed to identify genes that are specifically 
modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated bladder cancer tissue reveals genes that are not expressed in normal tissue 

20 or bladder cancer tissue, but are expressed in agent treated tissue. These agent-specific 

sequences can be identified and used by methods described herein for bladder cancer genes or 
proteins. In particular these sequences and the proteins they encode find use in marking or 
identifying agent treated cells. In addition, antibodies can be raised against the agent induced 
proteins and used to target novel therapeutics to the treated bladder cancer tissue sample. 

25 Thus, in one embodiment, a test compound is administered to a population of bladder 

cancer cells, that have an associated bladder cancer expression profile. By "administration" 
or "contacting" herein is meant that the candidate agent is added to the cells in such a manner 
as to allow the agent to act upon the cell, whether by uptake and intracellular action, or by 
action at the cell surface. In some embodiments, nucleic acid encoding a proteinaceous 

30 candidate agent (e.g., a peptide) may be put into a viral construct such as an adenoviral or 
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retroviral construct, and added to the cell, such that expression of the peptide agent is 
accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems can also be used. 

Once the test compound has been administered to the cells, the cells can be washed if 
desired and are allowed to incubate under preferably physiological conditions for some 
5 period of time. The cells are then harvested and a new gene expression profile is generated, 
as outlined herein. 

Thus, e.g., bladder cancer or non-malignant tissue may be screened for agents that 
modulate, e.g., induce or suppress the bladder cancer phenotype. A change in at least one 
gene, preferably many, of the expression profile indicates that the agent has an effect on 

1 0 bladder cancer activity. By defining such a signature for the bladder cancer phenotype, 

screens for new drugs that alter the phenotype can be devised. With this approach, the drug 
target need not be known and need not be represented in the original expression screening 
platform, nor does the level of transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on individual 

1 5 genes and gene products (proteins). That is, having identified a particular differentially 
expressed gene as important in a particular state, screening of modulators of either the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "bladder cancer proteins" 
or a "bladder cancer modulatory protein". The bladder cancer modulatory protein may be a 

20 fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
acids of the Tables 1 A-13. Preferably, the bladder cancer modulatory protein is a fragment. 
In a preferred embodiment, the bladder cancer amino acid sequence which is used to 
determine sequence identity or similarity is encoded by a nucleic acid of Tables 1 A-13. In 
another embodiment, the sequences are naturally occurring allelic variants of a protein 

25 encoded by a nucleic acid of Tables 1 A-13. In another embodiment, the sequences are 
sequence variants as further described herein. 

Preferably, the bladder cancer modulatory protein is a fragment of approximately 14 
to 24 amino acids long. More preferably the fragment is a soluble fragment. Preferably, the 
fragment includes a non-transmembrane region. In a preferred embodiment, the fragment has 
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an N-terminal Cys to aid in solubility. In one embodiment, the C-terminus of the fragment is 
kept as a free acid and the N-terminus is a free amine to aid in coupling, e.g., to cysteine. 

In one embodiment the bladder cancer proteins are conjugated to an immunogenic 
agent as discussed herein. In one embodiment the bladder cancer protein is conjugated to 
5 BSA. 

Measurements of bladder cancer polypeptide activity, or of bladder cancer or the 
bladder cancer phenotype can be performed using a variety of assays. For example, the 
effects of the test compounds upon the function of the bladder cancer polypeptides can be 
measured by examining parameters described above. A suitable physiological change that 

1 0 affects activity can be used to assess the influence of a test compound on the polypeptides of 
this invention. When the functional consequences are determined using intact cells or 
animals, one can also measure a variety of effects such as, in the case of bladder cancer 
associated with tumors, tumor growth, tumor metastasis, neovascularization, hormone 
release, transcriptional changes to both known and uncharacterized genetic markers (e.g., 

1 5 northern blots), changes in cell metabolism such as cell growth or pH changes, and changes 
in intracellular second messengers such as cGMP. In the assays of the invention, mammalian 
bladder cancer polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in vitro. 
For example, a bladder cancer polypeptide is first contacted with a potential modulator and 

20 incubated for a suitable amount of time, e.g., from 0.5-48 hours. In one embodiment, the 

bladder cancer polypeptide levels are determined in vitro by measuring the level of protein or 
mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELIS A and the like with an antibody that selectively binds to the bladder cancer polypeptide 
or a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 

25 hybridization assays, e.g., northern hybridization, RNase protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the bladder cancer protein 

30 promoter operably linked to a reporter gene such as luciferase, green fluorescent protein, 
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CAT, or P-gal. The reporter construct is typically transfected into a cell. After treatment 
with a potential modulator, the amount of reporter gene transcription, translation, or activity 
is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on individual 
5 genes and gene products (proteins). That is, having identified a particular differentially 

expressed gene as important in a particular state, screening of modulators of the expression of 
the gene or the gene product itself can be done. The gene products of differentially expressed 
genes are sometimes referred to herein as "bladder cancer proteins." The bladder cancer 
protein may be a fragment, or alternatively, be the full length protein to a fragment shown 
10 herein. 

In one embodiment, screening for modulators of expression of specific genes is 
performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
15 differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or isolated 
gene product is used; that is, the gene products of one or more differentially expressed 
nucleic acids are made. For example, antibodies are generated to the protein gene products, 
20 and standard immunoassays are run to determine the amount of protein present. 

Alternatively, cells comprising the bladder cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a bladder cancer 
protein and a candidate compound, and determining the binding of the compound to the 
bladder cancer protein. Preferred embodiments utilize the human bladder cancer protein, 
25 although other mammalian proteins may also be used, e.g., for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
bladder cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the bladder cancer 
protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
30 sample receiving areas (e.g., a microtiter plate, an array, etc.). The insoluble supports may be 
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made of a composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of a convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
5 typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 

10 composition and is nondiffiisable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
the protein or agent, excess unbound material is removed by washing. The sample receiving 

1 5 areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 
other innocuous protein or other moiety. 

In a preferred embodiment, the bladder cancer protein is bound to the support, and a 
test compound is added to the assay. Alternatively, the candidate agent is bound to the 
support and the bladder cancer protein is added. Novel binding agents include specific 

20 antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 

analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the like. 

25 The determination of the binding of the test modulating compound to the bladder 

cancer protein may be done in a number of ways. In a preferred embodiment, the compound 
is labeled, and binding determined directly, e.g., by attaching all or a portion of the bladder 
cancer protein to a solid support, adding a labeled candidate agent (e.g., a fluorescent label), 
washing off excess reagent, and determining whether the label is present on the solid support. 

30 Various blocking and washing steps may be utilized as appropriate. 
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In some embodiments, only one of the components is labeled, e.g., the proteins (or 
proteinaceous candidate compounds) can be labeled. Alternatively, more than one 

component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
5 useful. 

In one embodiment, the binding of the test compound is determined by competitive 
binding assay. The competitor is a binding moiety known to bind to the target molecule (e.g., 
a bladder cancer protein), such as an antibody, peptide, binding partner, ligand, etc. Under 
certain circumstances, there may be competitive binding between the compound and the 

10 binding moiety, with the binding moiety displacing the compound. In one embodiment, the 
test compound is labeled. Either the compound, or the competitor, or both, is added first to 
the protein for a time sufficient to allow binding, if present. Incubations may be performed at 
a temperature which facilitates optimal activity, typically between 4 and 40°C. Incubation 
periods are typically optimized, e.g., to facilitate rapid high throughput screening. Typically 

15 between 0.1 and 1 hour will be sufficient. Excess reagent is generally removed or washed 
away. The second component is then added, and the presence or absence of the labeled 
component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication that the test compound is binding 

20 to the bladder cancer protein and thus is capable of binding to, and potentially modulating, 
the activity of the bladder cancer protein. In this embodiment, either component can be 
labeled. Thus, e.g., if the competitor is labeled, the presence of label in the wash solution 
indicates displacement by the agent. Alternatively, if the test compound is labeled, the 
presence of the label on the support indicates displacement. 

25 In an alternative embodiment, the test compound is added first, with incubation and 

washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the test compound is bound to the bladder cancer protein with a higher affinity. Thus, if 
the test compound is labeled, the presence of the label on the support, coupled with a lack of 
competitor binding, may indicate that the test compound is capable of binding to the bladder 

30 cancer protein. 
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In a preferred embodiment, the methods comprise differential screening to identity 
agents that are capable of modulating the activity of the bladder cancer proteins. In this 
embodiment, the methods comprise combining a bladder cancer protein and a competitor in a 
first sample. A second sample comprises a test compound, a bladder cancer protein, and a 
5 competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the bladder cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 
agent is capable of binding to the bladder cancer protein. 

10 Alternatively, differential screening is used to identify drug candidates that bind to the 

native bladder cancer protein, but cannot bind to modified bladder cancer proteins. The 
structure of the bladder cancer protein may be modeled, and used in rational drug design to 
synthesize agents that interact with that site. Drug candidates that affect the activity of a 
bladder cancer protein are also identified by screening drugs for the ability to either enhance 

15 or reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably control 
and test samples are performed in at least triplicate to obtain statistically significant results. 
Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
Following incubation, samples are washed free of non-specifically bound material and the 

20 amount of bound, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be counted in a scintillation counter to determine the amount of 
bound compound. 

A variety of other reagents may be included in the screening assays. These include 
reagents like salts, neutral proteins, e.g., albumin, detergents, etc. which may be used to 
25 facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
30 compound capable of modulating the activity of a bladder cancer protein. The methods 
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comprise adding a test compound, as defined above, to a cell comprising bladder cancer 
proteins. Many different cell types may be transfected to contain a recombinant nucleic acid 
that encodes a bladder cancer protein. In a preferred embodiment, a library of candidate 
agents are tested on a plurality of cells. 
5 In one aspect, the assays are evaluated in the presence or absence or previous or 

subsequent exposure of physiological signals, e.g., hormones, antibodies, peptides, antigens, 
cytokines, growth factors, action potentials, and pharmacological agents including, e.g., 
chemotherapeutics, radiation, carcinogenics, or other cells (e.g., cell-cell contacts). In one 
example, the determinations are determined at different stages of the cell cycle process. 

10 In this way, compounds that modulate bladder cancer agents are identified. 

Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the bladder cancer protein. Once identified, similar structures are evaluated to identify 
critical structural feature of the compound. 

In one embodiment, a method of inhibiting bladder cancer cell division is provided. 

1 5 The method comprises administration of a bladder cancer inhibitor. In another embodiment, 
a method of inhibiting bladder cancer is provided. The method comprises administration of a 
bladder cancer inhibitor. In a further embodiment, methods of treating cells or individuals 
with bladder cancer are provided. The method comprises administration of a bladder cancer 
inhibitor. In one embodiment, a bladder cancer inhibitor is an antibody as discussed above. 

20 In another embodiment, the bladder cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to those of 
skill in the art, as described below. 
Soft agar growth or colony formation in suspension 

Normal cells require a solid substrate to attach and grow. When the cells are 

25 transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 
such as semi-solid or soft agar. The transformed cells, when transfected with tumor 
suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 

30 modulators of bladder cancer sequences, which when expressed in host cells, inhibit 
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abnormal cellular proliferation and transformation. A therapeutic compound would reduce or 
eliminate the host cells 1 ability to grow in stirred suspension culture or suspended in semi- 
solid media, such as semi-solid or soft. 

Techniques for soft agar growth or colony formation in suspension assays are 
5 described in Freshney ( 1 994) Culture of Animal Cells: A Manual of Basic Technique (3d ed.) 
Wiley-Liss, herein incorporated by reference. See also, the methods section of Garkavtsev, et 
al. (1996), supra, herein incorporated by reference. 
Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until they 
10 touch other cells. When the cells touch one another, they are contact inhibited and stop 
growing. When cells are transformed, however, the cells are not contact inhibited and 
continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 

15 pattern of normal surrounding cells. Alternatively, labeling index with (3H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with (3H)-thymidine at saturation density is a preferred 
20 method of measuring density limitation of growth. Transformed host cells are transfected 
with a bladder cancer-associated sequence and are grown for 24 hours at saturation density in 

non-limiting medium conditions. The percentage of cells labeling with (3H)-thymidine is 

determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 
25 Transformed cells have a lower serum dependence than their normal counterparts 

(see, e.g., Temin (1966) J. Nat'l Cancer Inst. 37:167-175; Eagle, et al. (1970) J. Exp. Med. 

131:836-879; Freshney (1994), supra). This is in part due to release of various growth factors 

by the transformed cells. Growth factor or serum dependence of transformed host cells can 

be compared with that of control. 
30 Tumor specific markers levels 
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Tumor cells release an increased amount of certain factors (hereinafter "tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells. See, e.g., 
"Angiogenesis, tumor vascularization, and potential interference with tumor growth" pp. 178- 
5 184 in Mihich (ed. 1985) Biological Responses in Cancer Plenum. Similarly, tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g., Folkman (1992) Sem Cancer Biol. 3:89-96. 

Various techniques which measure the release of these factors are described in 
Freshney (1994), supra. See also, Unkeless, et al. (1974) J. Biol. Chem. 249:4295-4305; 

10 Strickland and Beers (1976) J. Biol. Chem. 251:5694-5702; Whur, et al (1980) Br. J. Cancer 
42:305-312; Gullino "Angiogenesis, tumor vascularization, and potential interference with 
tumor growth" pp. 178-184 in Mihich (ed. 1985) Biological Responses in Cancer . Plenum; 
and Freshnev ( 1985) Anticancer Res . 5:111-130. 
Invasiveness into Matrigel 

15 The degree of invasiveness into Matrigel or some other extracellular matrix 

constituent can be used as an assay to identify compounds that modulate bladder cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 

20 gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1984), supra, can be used. Briefly, the level of 
invasion of host cells can be measured by using filters coated with Matrigel or some other 
extracellular matrix constituent. Penetration into the gel, or through to the distal side of the 
filter, is rated as invasiveness, and rated histologically by number of cells and distance 

25 moved, or by prelabeling the cells with 125j and counting the radioactivity on the distal side 
of the filter or bottom of the dish. See, e.g., Freshney (2000), supra. 
Tumor growth in vivo 

Effects of bladder cancer-associated sequences on cell growth can be tested in 
transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 

30 the bladder cancer gene is disrupted or in which a bladder cancer gene is inserted. Knock-out 
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transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous bladder cancer gene site in the mouse genome via homologous 
recombination. Such mice can also be made by substituting the endogenous bladder cancer 
gene with a mutated version of the bladder cancer gene, or by mutating the endogenous 
5 bladder cancer gene, e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 
is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
that possess germ cells partially derived from the mutant cell line. By breeding the chimeric 

1 0 mice it is possible to obtain a new line of mice containing the introduced genetic lesion. See, 
e.g., Capecchi, et al. (1989) Science 244:1288-1292. Chimeric targeted mice can be made. 
See Hogan, et al. (1988) Manipulating the Mouse Embryo: A Laboratory Manual CSH Press; 
and Robertson (ed. 1987) Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach IRL Press, Washington, D.C. 

1 5 Alternatively, various immune-suppressed or immune-deficient host animals can be 

used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella, et al. (1974) J. 
Nat'l Cancer Inst. 52:921-930), a SCID mouse, a thymectomized mouse, or an irradiated 
mouse (see, e.g., Bradley, et al. (1978) Br. J. Cancer 38:263-272; Selby, et al. (1980) Br. J. 
Cancer 41 :52-61) can be used as a host. Transplantable tumor cells (typically about 10^ 

20 cells) injected into isogenic hosts will produce invasive tumors in a high proportions of cases, 
while normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a bladder cancer-associated sequences are injected subcutaneously. After a 
suitable length of time, preferably about 4-8 weeks, tumor growth is measured (e.g., by 
volume or by its two largest dimensions) and compared to the control. Tumors that have 

25 statistically significant reduction (using, e.g., Students T test) are said to have inhibited 
growth. 

Polynucleotide modulators of bladder cancer 

Antisense and RNAi Polynucleotides 
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In certain embodiments, the activity of a bladder cancer-associated protein is down- 
regulated, or entirely inhibited, by the use of antisense polynucleotide, e.g., a nucleic acid 
complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a bladder cancer protein mRNA, or a subsequence thereof. 
5 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise naturally- 
occurring nucleotides, or synthetic species formed from naturally-occurring subunits or their 
close homologs. Antisense polynucleotides may also have altered sugar moieties or inter- 

10 sugar linkages. Exemplary among these are the phosphorothioate and other sulfur containing 
species which are known for use in the art. Analogs are comprehended by this invention so 
long as they function effectively to hybridize with the bladder cancer protein mRNA. See, 
e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant means, 

1 5 or can be synthesized in vitro. Equipment for such synthesis is sold by several vendors, 
including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known. 

Antisense molecules as used herein include antisense or sense oligonucleotides. 
Sense oligonucleotides can, e.g., be employed to block transcription by binding to the anti- 

20 sense strand. The antisense and sense oligonucleotide comprise a single-stranded nucleic 
acid sequence (either RNA or DNA) capable of binding to target mRNA (sense) or DNA 
(antisense) sequences for bladder cancer molecules. A preferred antisense molecule is for a 
bladder cancer sequences in Tables 1A-13, or for a ligand or activator thereof. Antisense or 
sense oligonucleotides, according to the present invention, comprise a fragment generally at 

25 least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive 
an antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given 
protein is described in, e.g., Stein and Cohen (1988) Cancer Res. 48:2659-2668; and van der 
Krol, et al. (1988) BioTechniques 6:958-976. 

RNA interference is a mechanism to suppress gene expression in a sequence specific 

30 manner. See, e.g., Brumelkamp, et al. (2002) Sciencexpress (21March2002); Sharp (1999) 



WO 03/003906 



PCT/US02/21338 



Genes Dev. 13:139-141; and Cathew (200n Curr. Op. Cell BioL 13:244-248. In mammalian 
cells, short, e.g., 21 nt, double stranded small interfering RNAs (siRNA) have been shown to 
be effective at inducing an RNAi response. See, e.g., Elbashir, et al. (2001) Nature 41 1 :494- 
498. The mechanism may be used to downregulate expression levels of identified genes, e.g., 
5 treatment of or validation of relevance to disease 
Ribozvmes 

In addition to antisense polynucleotides, ribozymes can be used to target and inhibit 
transcription of bladder cancer-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 

10 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes. See, e.g., Castanotto, et al. (1994) Adv. in Pharmacology 
25: 289-317 for a general review of the properties of different ribozymes. 

The general features of hairpin ribozymes are described, e.g., in Hampel, et al. (1990) 
Nucl. Acids Res. 18:299-304; European Patent Publication No. 0 360 257; U.S. Patent No. 

15 5,254,678. Methods of preparing them are well known. See, e.g., WO 94/26877; Ojwang, et 
al. (1993) Proc. Nat r l Acad. Sci. USA 90:6340-6344; Yamada, et al. (1994) Human Gene 
Therapy 1:39-45; Leavitt, et al.(1995) Proc. Nat'l Acad. Sci. USA 92:699-703; Leavitt, et al. 
(1994) Human Gene Therapy 5:1151-120; and Yamada, et al. (1994) Virology 205: 121-126. 
Polynucleotide modulators of bladder cancer may be introduced into a cell containing 

20 the target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 
substantially interfere with the ability of the ligand binding molecule to bind to its 

25 corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of bladder 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 
formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
understood that the use of antisense molecules or knock out and knock in models may also be 

30 used in screening assays as discussed above, in addition to methods of treatment 
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Thus, in one embodiment, methods of modulating bladder cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-bladder cancer antibody that reduces or eliminates the biological activity of an 
endogenous bladder cancer protein. Alternatively, the methods comprise administering to a 
5 cell or organism a recombinant nucleic acid encoding a bladder cancer protein. This may be 
accomplished in many ways. In a preferred embodiment, e.g., when the bladder cancer 
sequence is down-regulated in bladder cancer, such state may be reversed by increasing the 
amount of bladder cancer gene product in the cell. This can be accomplished, e.g., by 
overexpressing the endogenous bladder cancer gene or administering a gene encoding the 

10 bladder cancer sequence, using known gene-therapy techniques. In a preferred embodiment, 
the gene therapy techniques include the incorporation of the exogenous gene using enhanced 
homologous recombination (EHR), e.g., as described in PCT/US93/03868, hereby 
incorporated by reference in its entirety. Alternatively, e.g., when the bladder cancer 
sequence is up-regulated in bladder cancer, the activity of the endogenous bladder cancer 

15 gene is decreased, e.g., by the administration of a bladder cancer antisense nucleic acid. 

In one embodiment, the bladder cancer proteins of the present invention may be used 
to generate polyclonal and monoclonal antibodies to bladder cancer proteins. Similarly, the 
bladder cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify bladder cancer 

20 antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 

embodiment, the antibodies are generated to epitopes unique to a bladder cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The bladder cancer 

antibodies may be coupled to standard affinity chromatography columns and used to purify 

i 

bladder cancer proteins. The antibodies may also be used as blocking polypeptides, as 
25 outlined above, since they will specifically bind to the bladder cancer protein. 

Methods of identifying variant bladder cancer-associated sequences 

Without being bound by theory, expression of various bladder cancer sequences is 
correlated with bladder cancer. Accordingly, disorders based on mutant or variant bladder 
30 cancer genes may be determined. In one embodiment, the invention provides methods for 

82 



WO 03/003906 



PCT/US02/21338 



identifying cells containing variant bladder cancer genes, e.g., determining all or part of the 
sequence of at least one endogenous bladder cancer genes in a cell. This may be 
accomplished using many sequencing techniques. In a preferred embodiment, the invention 
provides methods of identifying the bladder cancer genotype of an individual, e.g., 
5 determining all or part of the sequence of at least one bladder cancer gene of the individual. 
This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced bladder cancer gene to a known bladder cancer 
gene, e.g., a wild-type gene. 

10 The sequence of all or part of the bladder cancer gene can then be compared to the 

sequence of a known bladder cancer gene to determine if differences exist. This can be done 
using many known homology programs, such as Bestfit, etc. In a preferred embodiment, the 
presence of a difference in the sequence between the bladder cancer gene of the patient and 
the known bladder cancer gene correlates with a disease state or a propensity for a disease 

1 5 state, as outlined herein. 

In a preferred embodiment, the bladder cancer genes are used as probes to determine 
the number of copies of the bladder cancer gene in the genome. 

In another preferred embodiment, the bladder cancer genes are used as probes to 
determine the chromosomal localization of the bladder cancer genes. Information such as 

20 chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the bladder 
cancer gene locus. 

Administration of pharmaceutical and vaccine compositions 

25 In one embodiment, a therapeutically effective dose of a bladder cancer protein or 

modulator thereof, is administered to a patient. By "therapeutically effective dose" herein is 
meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques. See, e.g., Ansel, et al. (1999) Pharmaceutical Dosage Forms and Drug 

30 Delivery Lippincott; Lieberman (1 992) Pharmaceutical Dosage Forms (vols. 1-3) Dekker, 
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ISBN 0824770846, 082476918X, 0824712692, 0824716981; Lloyd (1999) The Art. Science 
and Technology of Pharmaceutical Compounding Amer. Pharma. Assn.; and Pickar (1999) 
Dosage Calculations Thomson. As is known in the art, adjustments for bladder cancer 
degradation, systemic versus localized delivery, and rate of new protease synthesis, as well as 
5 the age, body weight, general health, sex, diet, time of administration, drug interaction and 
the severity of the condition may be necessary, and will be ascertainable with routine 
experimentation by those skilled in the art. USSN 09/687,576, further discloses the use of 
compositions and methods of diagnosis and treatment in bladder cancer and is hereby 
expressly incorporated by reference. 

10 A "patient" for the purposes of the present invention includes both humans and other 

animals, particularly mammals. Thus the methods are applicable to both human therapy and 
veterinary applications. In the preferred embodiment the patient is a mammal, preferably a 
primate, and in the most preferred embodiment the patient is human. 

The administration of the bladder cancer proteins and modulators thereof of the 

1 5 present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
some instances, e.g., in the treatment of wounds and inflammation, the bladder cancer 
proteins and modulators may be directly applied as a solution or spray. 

20 The pharmaceutical compositions of the present invention comprise a bladder cancer 

protein in a form suitable for administration to a patient. In the preferred embodiment, the 
pharmaceutical compositions are in a water soluble form, such as being present as 
pharmaceutical^ acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutical^ acceptable acid addition salt" refers to those salts that retain the 

25 biological effectiveness of the free bases and that are not biologically or otherwise 

undesirable, formed with inorganic acids such as hydrochloric. acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 

30 methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
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"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 
potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
5 acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: 

10 carrier proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, 
lactose, corn and other starches; binding agents; sweeteners and other flavoring agents; 
coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit dosage 
forms depending upon the method of administration. For example, unit dosage forms 

1 5 suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that bladder cancer protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing-the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 

20 packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise a bladder cancer 
protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 

25 solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, 

30 sodium lactate and the like. The concentration of active agent in these formulations can vary 
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widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 
the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington's Pharmaceutical Science (15th ed., 1980) and Hardman and Limbird 
(eds. 2001) Goodman and Gilman: The Pharmacologial Basis of Therapeutics McGraw-Hill. 
5 Thus, a typical pharmaceutical composition for intravenous administration would be 

about 0. 1 - 1 0 mg per patient per day. Dosages from about 0. 1 - 1 00 mg per patient per day 
may be used, particularly when the drug is administered to a secluded site and not into the 
blood stream, such as into a body cavity or into a lumen of an organ. Substantially higher 
dosages are possible in topical administration. Actual methods for preparing parenterally 

10 administrable compositions will be known or apparent to those skilled in the art, e.g., 

Remington's Pharmaceutical Science and Goodman and Gilman: The Pharmacologial Basis 
of Therapeutics, supra. 

The compositions containing modulators of bladder cancer proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 

15 compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 

20 depending on the dosage and frequency as required and tolerated by the patient. The 

composition should provide a sufficient quantity of the agents of this invention to effectively 
treat the patient. An amount of modulator that is capable of preventing or slowing the 
development of cancer in a mammal is referred to as a "prophylactically effective dose." The 
particular dose required for a prophylactic treatment will depend upon the medical condition 

25 and history of the mammal, the particular cancer being prevented, as well as other factors 
such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer based, at least in part, upon gene expression profiles. Vaccine strategies 

30 may be used, in either a DNA vaccine form, or protein vaccine. 
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It will be appreciated that the present bladder cancer protein-modulating compounds 
can be administered alone or in combination with additional bladder cancer modulating 
compounds or with other therapeutic agent, e.g., other anti-cancer agents or treatments. 
In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
5 comprising nucleic acid sequences set forth in Tables 1 A-13, such as antisense 

polynucleotides or ribozymes, will be introduced into cells, in vitro or in vivo. The present 
invention provides methods, reagents, vectors, and cells useful for expression of bladder 
cancer-associated polypeptides and nucleic acids using in vitro (cell-free), ex vivo, or in vivo 
(cell or organism-based) recombinant expression systems. 

10 The particular procedure used to introduce the nucleic acids into a host cell for 

expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors, and other methods for introducing cloned genomic DNA, 

15 cDNA, synthetic DNA or other foreign genetic material into a host cell. See, e.g., Berger and 
Kimmel (1987) Guide to Molecular Cloning Techniques from Methods in Enzvmology (vol 
152) Academic Press; Ausubel, et al. (eds. 1999 and supplements) Current Protocols in 
Molecular Biology Lippincott; and Sambrook, et al. (1989) Molecular Cloning: A Laboratory 
Manual (2d ed., Vol. 1-3) CSH Press. 

20 In a preferred embodiment, bladder cancer proteins and modulators are administered 

as therapeutic agents, and can be formulated as outlined above. Similarly, bladder cancer 
genes (including both the full-length sequence, partial sequences, or regulatory sequences of 
the bladder cancer coding regions) can be administered in a gene therapy application. These 
bladder cancer genes can include antisense applications, either as gene therapy (e.g., for 

25 incorporation into the genome) or as antisense compositions, as will be appreciated by those 
in the art. 

Bladder cancer polypeptides and polynucleotides can also be administered as vaccine 
compositions to stimulate HTL, CTL, and antibody responses.. Such vaccine compositions 
can include, e.g., lipidated peptides (Vitiello, et al. (1995) J. Clin. Invest. 95:341-349); 
30 peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 
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(Eldridge, et al. (1991) Molec. Immunol. 28:287-294; Alonso, et al. (1994) Vaccine 12:299- 
306; Jones, et al. (1995) Vaccine 13:675-681); peptide compositions contained in immune 
stimulating complexes (ISCOMS) (Takahashi, et al. (1990) Nature 344:873-875; Hu, et al 
(1998) Clin. Exp. Immunol. 113:235-243); multiple antigen peptide systems (MAPs) (Tarn 
5 (1988) Proc. Nat'l Acad. Sci. USA 85:5409-5413; Tarn (1996) J. Immunol. Methods 196:17- 
32); peptides formulated as multivalent peptides; peptides for use in ballistic delivery 
systems, typically crystallized peptides, viral delivery vectors (Perkus, et al. in Kaufinann 
(ed. 1996) Concepts in Vaccine Development de Gruyter; Chakrabarti, et al. (1986) Nature 
320:535-537; Hu, et al. (1986) Nature 320:537-547; Kieny, et al. (1986) AIDS 

10 Bio/Technology 4:790; Top, et al. (1971) J. Infect. Pis. 124:148-154; Chanda, et al. (1990) 
Virology 175:535-547), particles of viral or synthetic origin (see, e.g., Kofler, et al. (1996) I 
Immunol. Methods 192:25-35; Eldridge, et al. (1993) Sem. Hematol. 30:16-24; Falo, et al. 
(1995) Nature Med. 7:649-653), adjuvants (Warren, et al. (1986) Annu. Rev. Immunol. 
4:369-388; Gupta, et al. (1993) Vaccine 11:293-306), liposomes (Reddy, et al. (1992) L 

15 Immunol. 148:1585-1589; Rock (1996) Immunol. Today 17:131-137), or, naked or particle 
absorbed cDNA (Ulmer, et al. (1993) Science 259:1745-1749; Robinson, et al. (1993) 
Vaccine 11:957-960; Shiver, et al. in Kaufinann (ed. 1996) Concepts in Vaccine 
Development de Gruyter; Cease and Berzofsky (1994) Annu. Rev. Immunol. 12:923-989; 
and Eldridge, et al. (1993) Sem. Hematol. 30:16-24). Toxin-targeted delivery technologies, 

20 also known as receptor mediated targeting, such as those of Avant Immunotherapeutics, Inc., 
Needham, MA, may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a substance 
designed to protect the antigen from rapid catabolism, such as aluminum hydroxide or 
mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

25 Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 
Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 

30 tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
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polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

Vaccines can be administered as nucleic acid compositions wherein DNA or RNA 
5 encoding one or more of the polypeptides, or a fragment thereof, is administered to a patient. 
This approach is described, for instance, in Wolff, et al. (1990) Science 247:1465-1468 as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

10 cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the invention 
can be expressed by viral or bacterial vectors. Examples of expression vectors include 
attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 

15 vaccinia virus, e.g., as a vector to express nucleotide sequences that encode bladder cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 

20 described in Stover, et al. (1 99 1) Nature 351 :456-460. A wide variety of other vectors useful 
for therapeutic administration or immunization, e.g., adeno and adeno-associated virus 
vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the 
like. See, e.g., Shata, et al. (2000) Mol Med Today 6:66-71; Shedlock, et al. (2000) L 
Leukoc. Biol. 68:793-806; Hipp, et al. (2000) In Vivo 14:571-85. 

25 Methods for the use of genes as DNA vaccines are well known, and include placing a 

bladder cancer gene or portion of a bladder cancer gene under the control of a regulatable 
promoter or a tissue-specific promoter for expression in a bladder cancer patient. The 
bladder cancer gene used for DNA vaccines can encode full-length bladder cancer proteins, 
but more preferably encodes portions of the bladder cancer proteins including peptides 

30 derived from the bladder cancer protein. In one embodiment, a patient is immunized with a 
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DNA vaccine comprising a plurality of nucleotide sequences derived from a bladder cancer 
gene. For example, bladder cancer-associated genes or sequence encoding subfragments of a 
bladder cancer protein are introduced into expression vectors and tested for their 
immunogenicity in the context of Class I MHC and an ability to generate cytotoxic T cell 
5 responses. This procedure provides for production of cytotoxic T cell responses against cells 
which present antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an adjuvant 
molecule with the DNA vaccine. Such adjuvant molecules include cytokines that increase 
the immunogenic response to the bladder cancer polypeptide encoded by the DNA vaccine. 

10 Additional or alternative adjuvants are available. 

In another preferred embodiment bladder cancer genes find use in generating animal 
models of bladder cancer. When the bladder cancer gene identified is repressed or 
diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
to the bladder cancer gene will also diminish or repress expression of the gene. Animal 

1 5 models of bladder cancer find use in screening for modulators of a bladder cancer-associated 
sequence or modulators of bladder cancer. Similarly, transgenic animal technology including 
gene knockout technology, e.g., as a result of homologous recombination with an appropriate 
gene targeting vector, will result in the absence or increased expression of the bladder cancer 
protein. When desired, tissue-specific expression or knockout of the bladder cancer protein 

20 may be necessary. 

It is also possible that the bladder cancer protein is overexpressed in bladder cancer. 
As such, transgenic animals can be generated that overexpress the bladder cancer protein. 
Depending on the desired expression level, promoters of various strengths can be employed 
to express the transgene. Also, the number of copies of the integrated transgene can be 

25 determined and compared for a determination of the expression level of the transgene. 
Animals generated by such methods find use as animal models of bladder cancer and are 
additionally useful in screening for modulators to treat bladder cancer. 

Kits for Use in Diagnostic and/or Prognostic Applications 

90 



WO 03/003906 



PCT/US02/21338 



For use in diagnostic, research, and therapeutic applications suggested above, kits are 
also provided by the invention. In the diagnostic and research applications such kits may 
include one or more of the following: assay reagents, buffers, bladder cancer-specific nucleic 
acids or antibodies, hybridization probes and/or primers, antisense or inhibitory 
5 polynucleotides, ribozymes, dominant negative bladder cancer polypeptides or 

polynucleotides, small molecules inhibitors of bladder cancer-associated sequences etc. A 
therapeutic product may include sterile saline or another pharmaceutical^ acceptable 
emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions (e.g., 

1 0 protocols) for the practice of the methods of this invention. While the instructional materials 
typically comprise written or printed materials they are not limited to such. A medium 
capable of storing such instructions and communicating them to an end user is contemplated 
by this invention. Such media include, but are not limited to electronic storage media (e.g., 
magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. Such 

1 5 media may include addresses to internet sites that provide such instructional materials. 

The present invention also provides for kits for screening for modulators of bladder 
cancer-associated sequences. Such kits can be prepared from readily available materials and 
reagents. For example, such kits can comprise one or more of the following materials: a 
bladder cancer-associated polypeptide or polynucleotide, control positive or negative 

20 samples, reaction tubes, and instructions for testing bladder cancer-associated activity. 

Optionally, the kit contains biologically active bladder cancer protein. A wide variety of kits 
and components can be prepared according to the present invention, depending upon the 
intended user of the kit and the particular needs of the user. Diagnosis would typically 
involve evaluation of a plurality of genes or products. The genes will be selected based on 

25 correlations with important parameters in disease which may be identified in historical or 
outcome data. 
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EXAMPLES 

Example 1 : Gene Chip Analysis 

Molecular profiles of various normal and cancerous tissues were determined and analyzed 
using gene chips. RNA was isolated and gene chip analysis was performed as described 
5 (Glynne, et al. (2000) Nature 403:672-676; Zhao, et al. (2000) Genes Dev. 14:981-993). 



TABLE DESCRIPTIONS 
Table 1 A shows about 3413 that exhibit increased or decreased expression in bladder cancer 
10 samples. See USSN 60/302,814. 

Table 2A shows about 485 genes overexpressed in bladder tumors relative to normal tissues 
as analyzed using the Affymetrix/Eos Hu03 GeneChip array. See USSN 60/343,705. 

15 Table 3 A shows about 414 genes upregulated in bladder cancer relative to normal body 
tissues and preferred for utility as small molecule, antibody, DNA vaccine targets for the 
therapy of bladder cancer. These genes were selected from 59680 probesets on the 
Eos/Affymetrix Hu03 Genechip array. Gene expression data for each probeset obtained from 
this analysis was expressed as average intensity (Al), a normalized value reflecting the 

20 relative level of mRNA expression. 

Table 4A shows about 129 genes upregulated in bladder cancer relative to normal body 
tissues and preferred for utility as diagnostics of bladder cancer. These genes were selected 
from 59680 probesets on the Eos/Affymetrix Hu03 Genechip array. Gene expression data for 
25 each probeset obtained from this analysis was expressed as average intensity (Al), a 
normalized value reflecting the relative level of mRNA expression. 

Table 5 A shows about 149 genes upregulated in bladder cancer relative to normal body 
tissues. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
30 Genechip array. Gene expression data for each probeset obtained from this analysis was 

expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA 
expression. 

Table 6 A shows about 199 genes upregulated in bladder cancer relative to normal bladder 
35 tissue. These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 
Genechip array. Gene expression data for each probeset obtained from this analysis was 
expressed as average intensity (Al), a normalized value reflecting the relative level of mRNA 
expression. 

40 Table 7A shows about 63 genes downregulated in bladder tumors relative to normal bladder. 
These genes were selected from 59680 probesets on the Eos/Affymetrix Hu03 Genechip 
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array. Gene expression data for each probeset obtained from this analysis was expressed as 
average intensity (AI), a normalized value reflecting the relative level of mRNA expression. 

Table 8 A shows about 1440 genes upregulated in Ta or Tl bladder tumors from patients who 
5 later presented with muscle-invasive bladder tumors (stage T2-T4). Gene expression data for 
each probeset obtained from this analysis was expressed as average intensity (AI), a 
normalized value reflecting the relative level of mRNA expression. 

Table 9A shows about 1200 genes upregulated in Ta or Tl tumors of patients who later 
10 presented with either more Ta tumors or no tumors at all. Gene expression data for each 
probeset obtained from this analysis was expressed as average intensity (AI), a normalized 
value reflecting the relative level of mRNA expression. 

Table 10A shows about 65 genes upregulated in non-invasive exophytic Ta bladder tumors 
1 5 relative to ^2^4 muscle-invasive tumors. Gene expression data for each probeset obtained 
from this analysis was expressed as average intensity (AT), a normalized value reflecting the 
relative level of mRNA expression. 

Table 11 A shows about 106 genes upregulated in muscle-invasive T2-T4 bladder tumors 
20 relative to non-invasive exophytic Ta bladder tumors. Gene expression data for each 

probeset obtained from this analysis was expressed as average intensity (AI), a normalized 
value reflecting the relative level of mRNA expression. 

Table 12A shows the Pkey, ExAccn, UnigenelD, and Unigene Title for all of the sequences 
25 in Table 13. Seq ID No. is used to link Table 12A to table 13. 

Tables 1B-12B show the accession numbers for those Pkey's lacking UnigenelD 's for tables 
1 A -12A, respectively. For each probeset is listed a gene cluster number from which 
oligonucleotides were designed. Gene clusters were compiled using sequences derived from 
30 Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity 
using Clustering and Alignment Tools (DoubleTwist, Oakland California). Genbank 
accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 

35 Tables 1C-12C show genomic positioning for Pkey's lacking Unigene ED's and accession 
numbers for tables 1 A -12 A, respectively. For each predicted exon, is listed genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



40 TAR) F 1A: 



45 



Pkey: 

ExAccn: 

UnigenelD: 

Unigene Titter 

R1: 

R2: 

Target Type: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

please refer to original application 

please refer to original application 

downregulate stage if target is downregulated in bladder tumors relative to normal bladder or 
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early stage if target is an eari stage (Ta) bladder tumor marker or 
late stage if target is a late stage (T2-T4) bladder tumor marker or 
T2-T4 grade 3 papilloma marker or T2-T4 grade 3 solid tumor marker or Upregulate stage 
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Pkey 


ExAccn 


UnigenelD 


Unigene Title 


R1 


R2 


Target Type 


400440 


X83957 


Hs.83870 


nebvtin 


0.17 


Z05 


dovynregulate stage 


400888 








0.24 


1.97 


downregulate stage 


401566 








0.19 


4.06 


downregulate stage 


401669 








0.2 


Z05 


downregulate stage 


401691 








0.04 


10.13 


downregulate stage 


401905 








0.3 


1.87 


downregulate stage 


402076 








0.06 


6.51 


downregulate stage 


402110 








0.43 


2.35 


downregulate stage 


402271 


NMJJ02197 


Hs.1 54721 


aconitase 1, soluble 


0.21 


2.16 


downregulate stage 


403362 






0.18 


4.44 


downregulate stage 


403687 








0.32 


1.91 


downregulate stage 


403959 








0.14 


Z27 


downregulate stage 


404015 








0.2 


2.48 


downregulate stage 


404059 








0.38 


1.84 


downregulate stagB 


404152 








0.32 


1.85 


downregulate stage 


404498 








0.14 


£18 


downregulate stage 


404819 








0.19 


5.25 


downregulate stage 


405001 


U5B196 


Hs.296281 


inlerteukin enhancer binding factor 1 


0.15 


2.92 


downregulate stage 


405349 






0.18 


3.6 


downregutate stage 


405390 








0.3 


Z54 


downregulate stage 


405735 








0.13 


Z44 


downregulate stage 


405968 








0.26 


1.85 


downregulate stage 


406017 








0.32 


Z28 


downregulate stage 


406305 


BE261320 


Hs.297096 


transcriptional adaptor 3 (ADA3, yeast h 


0.42 


1.93 


downregulate stage 


406320 








0.37 


Z01 


downregulate stage 


406704 


M21665 


Hs.929 


myosin, heavy polypeptide 7. cardiac mus 


0.3 


Z84 


downregutate stage 


406706 


X03740 


Hs.231581 


myosin, heavy polypeptide 1, skeletal mu 


0.14 


7.4 


downregulate stage 


406707 


S73840 


Hs.931 


myosin, heavy porypepfide 2, skeletal mu 


0.05 


1Z51 


downregulate stage 


407013 


U35637 




gb:Kuman nebulin mRNA, partial cds 


0.14 


Z17 


downregulate stage 


407245 


X90568 


Hs.172004 


Ufin 


0.02 


15.21 


downregutate stage 


407330 


AA582607 




gb:nn51b05,s1 NCLCGAP_Kid6 Homo sapiens 


0.42 


1.87 


downregulate stage 


407571 


AM46183 


Hs.9572 


ESTs 


0.38 


Z13 


downregulate stage 


407682 


AL035858 


Hs.3807 


FXYO domain-containing ton transport reg 


0.34 


Z56 


downregulate stage 


407815 


AW373860 


Hs.301716 


ESTs 


0.31 


Z44 


downregulate stage 


407834 


AW084991 


Hs.26100 


ESTs 


0.15 


Z98 


downregulate stage 


407891 


AA486620 


Hs.41135 


endomucin-2 


0.15 


3.33 


downregulate stage 


407906 


AA369665 


Hs.41185 


Homo sapiens mRNA; cDNA DKFZp56401262 (f 


0.12 


8.05 


downregulate stage 


407938 


AA905097 


Hs.85050 


phosphoJamban 


0.08 


a77 


downregulate stage 


407965 


W21483 


Hs.41707 


heat shock 27kD protein 3 


0.25 


Z29 


downregulate stage 


408009 


AF020498 


Hs.41735 


purinergfc receptor P2X, Bgand-gated io 


0.49 


1.91 


downregulate stage 


408139 


AA451966 


Hs.43005 


RAB9-Eke protein 


0.41 


1.88 


downregulate stage 


408221 


AA912183 


Hs.47447 


ESTs 


0.04 


24.1 


downregulate stage 


408374 


AW025430 


Hs.1 55591 


fbrkhead box F1 


0.35 


Z85 


downregulate stage 


408493 


BE206854 


Hs.46039 


phosphoglycerate mutase 2 (muscle) 


0.09 


9.04 


downregulate stage 


408508 


AI80B109 


Hs.1 35736 


K1AA1 580 protein 


0.45 


Z2 


downregulate stage 


408614 


AU 37698 


Ks.46531 


Homo sapiens mRNA; cDNA DKFZp434C191 5 (f 


0.13 


5.48 


downregulate stage 


408652 


R43409 


Hs.6829 


ESTs 


0.33 


Z5 


downregulate stage 


408753 


A1337192 


Hs.47438 


SH3 domain binding glutamic acid-rich pr 


0.05 


6.94 


downregulate stage 


408896 


A1610447 


Hs.48778 


niban protein 


0.39 


Z01 


downregulate stage 


409277 


T(S558 


Hs.1 56880 


ESTs 


0.18 


Z75 


downregulate stage 


410023 


AB017169 


Hs.57929 


slit (Orosophila) homoJog 3 


0.07 


Z7 


downregulate stage 


410036 


R57171 


Hs.57975 


calsequestrin 2, cardiac muscle 


0.09 


5.23 


downregulate stage 


410132 


NM_003480 


Hs.58882 


MicrofibrD- associated grycoprotein-2 


0.24 


Z34 


downregulate stage 


410168 


AW834050 


Hs.9973 


lensin 


0.39 


2.17 


downregulate stage 


410243 


Uo34Ui 


HS^OsUUb 


ESTs, Weakly similar to ailematively sp 


U.ll 


Lac 


downregulate stage 


410339 


AI916499 


Hs.298258 


ESTs 


0.15 


Z16 


downregulate stage 


410677 


NMJKJ3278 


Hs.65424 


telranectin (plasmirtogen-blnding protein 


0.32 


Z65 


downregulate stage 


410868 


T06529 


Hs.98518 


Homo sapiens cONA FU 11 490 ns, done HE 


0.2 


Z74 


downregulate stage 


411048 


AK001742 


Hs.67991 


hypothefcal protein DKFZp434G0522 


0.2 


1.92 


downregulate stage 


411067 


AI681006 


Hs.301543 


ESTs 


0.11 


3.41 


downregulate stage 


411069 


AL133092 


Hs.68055 


hypotheBcal protein OKFZp434l0428 


0.17 


5.8 


downregulate stage 


411644 


H92064 


Hs.301548 


ESTs 


0.06 


13.8 


downregulate stage 


411741 


AW859850 




gb:RCO-CT0358-07129941 1-d03 CT0358 Homo 


0.36 


Z5 


downregulate stage 


412047 


AA934589 


Hs.49696 


ESTs 


0.18 


3.57 


downregulate stage 


412095 


AI624707 


Hs.5921 


Homo sapiens cDNA: FU21592 fis, clone C 


0.32 


1.89 


downregulate stage 


412389 


AW947655 




gb:RCQ^0O03-140300-O31-b07 MT0003 Homo 


0.38 


Z6 


downregulate stage 


412442 


AI983730 


Hs.26530 


serum deprivation response (phosphatidyl 


0.12 


3.67 


downregutate stage 


412519 


AA196241 


HSJ3980 


troponin T1. skeletal, slow 


0.24 


1.86 


downregulate stage 


412622 


AW664708 


Hs.171959 


ESTs 


0.05 


5.45 


downregulate stage 


412649 


NM.002206 


Hs.74369 


integrin, alpha 7 


0.29 


Z95 


downregulate stage 


412659 


AW753865 


Hs.74376 


olfactomedin related ER localized protei 


0.18 


Z06 


downregulate stage 


412758 


Y07818 


Hs.74566 


dirr/dropyrimidinase-iike 3 


0.3 


Z23 


downregulate stage 



94 



WO 03/003906 



PCT/US02/21338 



5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



412802 


U41518 


Hs.74602 


aquaporin 1 (channel-forming integral pr 


0.11 


2.71 


downregulate stage 


412975 


T70956 


Hs.75106 


clusterin (complemenl lysis inhibitor, S 


0.44 


2.03 


downregutate stage 


413074 


AI871368 


Hs.B417 


ESTs 


0.47 


1.91 


downregulate stage 


413272 


AA127923 


Hs.293256 


ESTs 


0.09 


4.44 


downregulate stage 


413276 


Z24725 


Hs.75260 


mitogen inducible 2 


0.23 


248 


downreguiatB stage 


413508 


BE145364 




gb:lLr>HT0198-151099*125-e05 HT0198 Homo 


0.31 


2.53 


downregutate stage 


413624 


BE177019 


Hs.75445 


SPARC-like 1 {mast9, hevin) 


0.33 


2.17 


downregulate stage 


41377B 


AA090235 


Hs.75535 


myosin, light polypeptide 2, regulatory, 


0.33 


2.63 


downregulate stage 


414063 


H26904 


Hs.75736 


apoGpoprotein D 


0.42 


1.85 


downregutate stage 


414241 


AA425085 


Hs.4007 


SarcolernmaJ-assoc! aied protein 


0.16 


2.22 


downregulate stage 


414290 


A1568801 


Ks.71721 


ESTs 


0.02 


10 


downreguiatB stage 


414629 


AA345824 


Hs.76688 


carboxy lest erase 1 (rnono^te/macrophaga 


0.13 


4.14 


downregutate stage 


414657 


AA424074 


Hs 76780 


protein phosphatase 1 , regulatory (inhib 


0.33 


2.14 


downregutate stage 


414712 


N88858 


Hs.77039 


ribosomal oroleln S3A 


0.4 


25 


downregulate stage 


414903 


AA451700 


Hs.85835 


Homo sapiens cDNA: FU 22841 fis, done K 


0.3 


3.3 


downregutate stage 


415165 


AW887604 


Hs 78065 


complement component 7 


0.04 


3.41 


downregulate stag b 


415274 


AF001548 


Hs.78344 


myosin, heavy polypeptide 11, smooth mus 


0.2 


3.29 


downregulate stage 


415447 


Z97171 


Hs.78454 


myocflin, trabecular meshwork inducible 


0.15 


6.55 


downregulate stage 


415672 


N53097 


Hs 193579 


ESTs 


0.28 


3.55 


downregulate stage 


415934 


NMJW0928 


Hs.992 


phospholipase A2, group IB (pancreas) 


0.34 


2.64 


downregulate stage 


416127 


N43843 


Hs.79022 


GTP-binding protein overexpressed in ske 


0.3 


1.98 


downregulate stage 


416349 


X69089 


Hs.79227 


myomesin (M-protein) 2{165kD) 


0.41 


1.96 


downregulate stage 


416585 


X54162 


Hs.79386 


letomodin 1 (smooth muscle) 


0.02 


49.3 


downregulate stage 


416854 


H40164 


Hs.80296 


Purkinje cell protein 4 


0.02 


7.55 


downregutate stage 


416941 


BE000150 


Hs.48778 


niban protein 


0.27 


2.16 


downregulate stage 


416982 


J05401 


Hs.80691 


creatine kinase, mitochondrial 2 (sarcom 


0.29 


243 


downregulate stage 


417011 


F08212 


Hs.234898 


ESTs 


0.41 


2.06 


downregulate stage 


417298 


AW565639 


Hs.37958 


ESTs 


0.27 


a7 


downregulate stage 


417501 


AL041219 


Hs.82222 


sema domain, immunoglobulin domain (ig), 


0.39 


2.08 


downregulate stage 


417553 


L09190 


Hs.82276 


trichohyadln 


0.29 


2.59 


downregulate stage 


417987 


AA210872 


Hs.50133 


ESTs 


0.22 


2.09 


downregulate stage 


418297 


R91254 




gb:yp94e1Zs1 Soares fetal Over spleen 


0.28 


1.9 


downregulate stage 


416332 


R34976 


Hs.78293 


ESTs 


0.2 


3.9 


downregulate stage 


418391 


NMJW3281 


Hs.84673 


troponin I, skeletal, slow 


0.35 


202 


downregulate stage 


418409 


AA219332 


Hs,120869 


ESTs, Weakly similar to AF092922 1 ret/n 


0.21 


3.8 


downregulate stags 


418421 


R58620 


Hs 85050 


phos p h olamban 


0.2 


208 


downregulate stage 


418489 


U76421 


Hs.85302 


adenosine deaminase, RNA-spedfic, B1 (h 


0.05 


21.55 


downregutate stage 


418533 


NMJJ04533 


Hs.85937 


myosin-birKJing protein C, fast-type 


0.42 


1.95 


downregulate stage 


418787 


AW296134 


Hs.86999 


ESTs 


0.48 


1.87 


downregulate stage 


418793 


AW382987 


Hs.88474 


prostaglandifvendopeioxide synthase 1 (p 


0.26 


243 


downregutate stage 


418947 


W52990 


Hs.22860 


ESTs 


0.13 


7.18 


downregutate stage 


419037 


R39895 


Hs.7864 


ESTs 


0.27 


2 


rfownreairfafe staoft 


419441 


AW023731 




Hnno sariens mRNA: cDNA QKF7n<iRfil15?4 ff 


0.19 


5.25 


rffwnrpfiiilata slaop 

UvnlMvwUwUP vUSjjC 


419535 


AW1 39550 


Hs.115173 


ESTs 


0.31 


259 


downregutate stage 


419682 


H13139 


Hs.92282 


psirsd^iQcs hocnGorfornsjn tninscnpUon fs 


0.28 


238 


downregutate stage 


419685 


W76083 


Hs 173077 


ESTs 


0.4 


221 


downregulate stage 


419703 


AI793257 

nil wv4wf 


Hs 128151 


ESTs 


0.09 


3.52 


riownreniilatp *5tanp 


419942 


U25138 


Hs.93841 


potassium targe conductance calcium-acti 


0.28 


298 


downregulate stage 


420058 




Hs 94694 


Hnmn cAnrPir; rTlMA Fl M fWfil fla rfnno NT 


0.3 


209 


fViwrtrfimilaffl stano 

KBJ mill tjy U KKU owyo 


420195 


N 44348 


Hs.300794 


ESTs 


0.22 


279 


downregutate stage 


420261 


AW206093 


Hs.748 


fibroblast growth factor receptor 1 (fins 


0.35 


198 


downregulate stage 


420674 


NM.000055 


Hs*1327 


butyrytcbolinesterase 


0.29 


15 


downregulate stage 


421296 


NMJXJ2666 


Hs 1032S3 


rip nli ran 


0.36 


211 


downregulate stage 


421639 


NM.012082 


Hs.297921 


Homo sapiens mRNA full length insert cDM 


0.13 


4.3 


downregutate stage 


421763 


AW1 63500 


Hs.1 08080 


cvsteine and nh/cJns-rich orntein 1 


0.26 


3.49 


downregulate stage 


421853 


AL1 17472 


Hs 108924 


DKFZP586P1422 crotan 


0.14 


5 


downregulate stage 


422103 


AA984330 


Hs.111676 


protein kinase H1 1 ; smaD stress protein 


o!2 


229 


downregulate stage 


422287 


F16365 


Hs.114346 


cytochrome c oxidase subunJi Vila polype 


0.27 


258 


downregulate stage 


422320 


AI745249 


Hs.23650 


ESTs, WeaMy similar to AAB47496 NG5 (H. 


0.24 


295 


downregulate stage 


422633 


X56832 


Hs.1 18804 


enolase 3, (beta, muscle) 


0.23 


3.57 


downregulate stage 


422639 


AI929377 


Hs!l73724 


creatine kinase, brain 


0.39 


1.97 


downregulate stage 


423334 


AK000906 


Hs.1 27273 


hypothetical protein FU10Q44 


0.37 


229 


downregulate stage 


423587 


AA328074 


Hs.284256 


hypothetical protein FU14033 similar b 


0.37 


247 


downregulate stage 


423889 


AL035447 


Hs.1 34594 


hypothetical protein LOC57158 


0.24 


243 


downregulate stage 


424181 


AL039482 


Hs.142517 


Homo sapiens mRNA; cONA DKF2p434P0810 (f 


0.27 


228 


downregulate stage 


424206 


NM 003734 


Hs.198241 


amine oxidase, copper containing 3 (vase 


0.3 


259 


downregulate stage 


424479 


AF0~64238 


Hs.149098 


smootheOn 


0.% 


3.29 


downregulate stage 


424580 


AA446539 


Hs.35092 


ESTs 


0.15 


257 


downregulate stage 


424846 


AU077324 


Hs.1832 


neuropeptide Y 


0.4 


204 


downregulate stage 


424938 


AW102607 


Hs.245233 


ESTs 


0.29 


216 


downregulate stage 


424982 


U94777 


Hs.154084 


phosohorylase, glycogen; muscle (McArdte 


0.42 


1.89 


downregulate stage 


425383 


083407 


Hs.156007 


Down syndrome critical region gene 1-iik 


0.14 


1.86 


downregulate stage 


425545 


N98529 


Hs.158295 


Human mRNA far myosin fighl chain 3 (MIC 


0.03 


13.25 


downregulate stage 


425622 


AW360847 


Hs.16578 


ESTs 


0.3 


219 


downregulate stage 


425751 


T19239 


Hs.1940 


crystallln, alpha B 


0.47 


1.92 


downregulate stage 


425869 


AA524547 


Hs.160318 


FXYD domain-containing ion transport reg 


0.45 


1.85 


downregulate stage 


425887 


AL049443 


Hs.161283 


Homo sapiens mRNA; cDNA DKF2p586M2020 (f 


0.19 


285 


downregutate stage 
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425932 


MB 1650 


Ks.1968 


426354 


NM 004010 


Hs 169470 


426429 


X73114 


Hs 169849 


426752 


X69490 


Hs.172004 


426809 




Hs 29706 


427078 


niof quo* 


Hs.111902 


427136 




Hs 173716 


427164 


ADV0/ / 4 I 


Hs 173R71 
no. 1 / JO/ I 


427185 


AVJQQQ'jn 
HrtOJjQiJOu 


186R74 
ru>. looo/H 


14 1 Jl J 


ARfm7Q79 

ADUU/3/4 


ns. i / / ooo 


4971Q1 

It 103 J 


AR09QH1H 


ns. i / (Ooo 


A07RRR 
44/000 


Ar I040UJ 


He 1RMA1 


A97R7R 

44/0/0 


AA3Q<MK9 
AAO34U04 


MS- 1 0U40O 




A A At 7ftJ?G 
AA4t AJ&O 


Lie 1 Q7COO 


94 /SOU 


AAAIRIftR 
AA4IO0UO 




440UO/ 


AAlyUO/ J 


US. 104441 


^40 1 00 


AA7715M9 
AM/ f 0044 


He 9Q17QQ 


AOQOOi 


1 1QR7Q4 


ns.ioou/o 


440043 


AA440U31 
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409750 AA302840 gb:EST10534 Adipose tissue, white i Homo 

409789 8E256027 Hs.180946 ribosomal protein 15 

409794 AW885691 gb:RC4-OT0071-24030(M)13-b04 OT0071 Homo 

409977 AW805510 Hs.97056 hypothetical protein FU21 634 

5 409985 AW291944 Hs.122139 ESTs 

409989 R37868 Hs.13333 ESTs 

409995 AW960597 Hs.30164 ESTs 

410013 AF067173 Hs.57904 mago-nashi (Drosophfla) homotog, protife 

410044 BE566742 Hs.58169 highly expressed in cancef, rich in leuc 

10 410071 AW582568 gb:RC1-ST027B480100411-hMST0278Homo 

410102 AW248508 Hs.279727 Homo sapiens cDNA FU 14035 fis, done HE 

410114 AW590540 Hs^71280 ESTs 

410117 AK001588 Hs.58650 hypolheCcal protein FU 10724 

t _ 410153 BE311925 Hs.15830 Homo sapiens cDNA FU 12691 fis. clone NT 

15 410181 AI468210 Hs.261285 ptebtroptc regulator 1 (PRL1, Arabktops 

410196 AI936442 Hs.59838 hypothelical protein FU 10808 

410252 AW821182 Hs.61418 mfcrofibrillar-assoclated protein 1 

410259 AK000337 Hs.61485 hypothetical protein 

410276 AI554545 Hs.68301 ESTs 

20 410278 AW614396 Hs.282230 ESTs 

410325 AB023154 Hs.62264 KIAA0937 protein 

410356 BE244668 Hs.62543 dual adaptor of phosphotyrosine and 3-ph 

410388 AA831460 Hs.22039 hepatocyte nuclear factor 3, alpha 

410399 BE068889 Hs.63236 synuclein, gamma (breast cancer-speriJic 

25 410420 AA224053 Hs.172405 ESTs, Moderately similar to I52835 H-NUC 

410429 AA3106O0 Hs.63657 hypothetical protein FU 11 005 

410442 X73424 Hs.63788 propiorryl Coenzyme A carboxylase, beta p 

410475 AW749927 gb:QV&BT0537-231299-Q49-f03BT0537Hcmo 

_ _ 410495 N95428 gb:zb80d09.s1 Soares.senescenUibroblas 

30 410501 AI675688 Hs.83286 ESTs 

410503 AW975746 Hs.188662 Homo sapiens cDNA: FU 23421 fis, clone H 

410520 AW752710 gb:IL3-CT02t9-28l099-024-A03 CT0219 Homo 

410534 AW905138 gb:QW^N1071-28O400-2O7-gO7NN1071 Homo 

410537 AW753108 gb^M1-CT0247-08010(M)OB^10 CT0247 Homo 

35 410553 AW016824 Hs.68784 ESTs 

410560 N29220 gfcyx43b05.r1 Soares melanocyte 2NbHM Ho 

410561 BE540255 Hs.6994 Homo sapiens cDNA: FU 22044 fis, clone H 

410562 AW858528 gb:CM3-CT034 1-1 50300- 119-h 11 CT0341 Homo 
An 410579 AK001628 Hs.64691 KIAA0483 protein 

40 410634 AW888653 Hs.266859 ESTs 

410664 NM.006033 Hs.65370 lipase, endoihefiai 

410668 BE379794 Hs.65403 hypothetic^ protein 

410730 AW368B60 Hs.293950 ESTs 

410751 AA357918 gb£ST66726 Fetal lung Ul Homo sapiens 

45 410754 T63840 gteyc16b10.s1 Stratagene lung (937210) H 

410762 AF226053 Hs.66170 HSKM-B protein 

410764 AW978159 Hs.250164 ESTs, Weakly similar to coded for by C 

410782 AW504860 Hs.288836 Homo sapiens cOMA RJ 1 2673 fis, done NT 

_ 410794 AA248010 Hs.154669 ESTs 

50 410804 U64820 Ks.66521 Machado-Joseph disease (spinocerebeDar 

410844 AW807073 gb:MR4-ST0062-O3119M18406STO062 Homo 

410855 X9779S Hs.66718 RAD54 (S-cerevisiaeHta 

410910 AW810204 gb:MR«T0125O2119W)17-d08ST0 125 Homo 

410973 AW812278 gb:RC0-ST0174-21 109941 1-H12ST0174 Homo 

55 410976 R36207 Hs.25092 ESTs 

410997 AW812877 gb:RC^ST01 86-3001 00-01 7-eO4ST01 86 Homo 

410998 W28247 Hs.82007 WAA0094 protein 
411036 AA857218 Hs.297007 ESTs 

rr . 41 1 1 1 0 H93000 . gb:yv07fl)1 .si Soares fetal Bver spleen 

60 411132 AW819191 gb:CM1,ST0283-071299461-d08 ST0283 Homo 

411137 AW819455 gb:RC5-ST0293421 299-031 -AQ4 ST0293 Homo 

411157 AW819867 gb:QVr>ST0294-070300-151-f02ST0294 Homo 

411159 AW820178 gb:QV0-ST0294- 100400-1 85^07 ST0294 Homo 

, e 411170 AW820503 gb^V2-ST0298-14O200-042-b05 ST0298 Homo 

65 411193 AW821484 gb:IL2-ST0311-211299-028-F12ST0311 Homo 

411242 BE146808 gb:QV4-HT0222-181 099-01 3^03 HT0222 Homo 

411245 AW833441 gb:OV4-TT0008-27 1 099-02Otf) 1 TT0008 Homo 

411263 BE297802 Hs.69360 kinesirhfoe 6 (mitotic centromBre-assoc 

„ 411282 AW995011 gb:QWBNTJ04<M 7030^161-^^ 

70 411284 N28519 Hs. 135191 ESTs, Weakly similar to unnamed protein 

411294 AW859729 Hs.42680 ESTs 

411327 AW836922 gb:GVUT0036-150200-074-r)D6 LT0036 Homo 

411338 AW731782 Hs.116122 ESTs, Weakly similar to unnamed protein 

411339 BE164598 gb:RC34fT0470-1 2020001 3*1 0 HT0470 Homo 
75 411383 AA001394 Hs.69749 KIAA0087 gene product 

411387 AW842339 Hs.130815 hypolheBcaJ protein FU21 870 

411400 AA311919 Hs.69851 GAR1 protein 
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interferon, gamma-indudbte protein 16 


10.45 


0.05 


upregulate stage 


425304 


AA463844 


Hs.31339 


fibroblast growth factor 1 1 


1.57 


0.51 


upregulate stage 


425316 


AA354977 


Hs.191565 


ESTs. Moderately similar to NSD1 protein 


5.05 


0.12 


upregulate stage 
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425322 


U63630 


Hs.155637 


protein kinase, DNA-activated, catalytic 


14.7 


0.05 


upregulate stage 


425362 


AA355936 




gb:EST64410 JuricatT-ceflsVl Homosapis 


1 


1 


upregulate stage 


425397 


J04088 


Hs.156346 


lopoisomerase (DNA) D alpha (170kO) 


8.24 


0.09 


upregulate stage 


425403 


AL023753 


Hs.156406 


Human DNA sequence from done 1198H6 on 


1 


0.22 


upregulate stage 


425415 


M13903 


Hs.157091 


invotucrin 


1.19 


0.55 


upregulate stage 


425420 


BE536911 


Hs.234545 


ESTs, Weakly simitar to AF1 55135 1 novel 


2.85 


0.13 


upregulate stage 


425463 


AK000740 


Hs.157986 


hypothetical protein FU20733 


9 


0.07 


upregulate stage 


425465 


L18964 


Hs.1904 


protein kinase C, iota 


9.6 


0.07 


upregulate stage 


425467 


R16484 


Hs.190075 


ESTs 


1 


0.83 


upregulate stage 


425492 


AL021918 


Hs.158174 


zinc finger protein 184 (KruppeUike) 


3.2 


0.15 


upregulate stage 


425607 


U09860 


Hs.158333 


protease, serine, 7 (enterckinass) 


1 


1 


upregulate stage 


425608 


AA360486 


Hs.92448 


ESTs 


4.7 


0.14 


upregulate stage 


425614 


AI334963 


Hs. 156256 


ESTs 


2.65 


0.14 


upregulate stage 


425641 


079758 


Hs.14355 


Homo sapiens cDNA FU13207 fis, clone NT 


4.86 


0.1 


upregulate stage 


425660 


M521184 


Hs.105504 


ESTs 


1 


0.31 


upregulate stage 


425665 


AK001050 


Hs.159066 


hypotheGca) protein FU10188 


1.25 


0.19 


upregulate stage 


425672 


AA361483 




gb:EST70790 T-cell tyrnphoma Homo sapiens 


1 


1 


upregulate stage 


425707 


API 15402 


Hs.11713 


E74-iike factor 5 (sts domain transcript 


3.96 


0.13 


upregulate stage 


425726 


AF085808 


Hs.159330 


uroplakin 3 


0.92 


0.79 


upregulate stage 


425742 


AJ001454 


Hs.159425 


lestican 3 


1 


1 


upregulate stage 


425785 


T27017 


Hs.159528 


Homo sapiens clone 24400 mRNA sequence 


1 


0.39 


upregulate stage 


425811 


AL039104 


Hs.159557 


karyopherin alpha 2 (RAG cohort 1, impor 


1.B9 


0.44 


upregulate stage 


425843 


BE313280 


Hs.159627 


death associated protein 3 


3.1 


0.15 


upregulate stage 


425852 


AK001504 


Hs.159651 


death receptor 6 


1.72 


0.47 


upregulate stage 


425883 


AL137708 


Hs.161031 


Homo sapiens mRNA; cONA DKFZp434K0322 (f 


0.95 


0.68 


upregulate stage 


426010 


AA136563 


Hs.1975 


Homo sapiens cDNA: RJ21007 (is, clone C 


1 


0.34 


upregulate stage 


426028 


NMJJ01110 


Hs.172028 


a disinlegrin and metaiioproteinase doma 


14.3 


0.04 


upregulate stage 


426101 


AL049987 


Hs.166361 


Homo sapiens mRNA; cONA OKFZp564F1 1 2 (fr 


11.75 


0.05 


upregulate stage 


426108 


AA622037 


Hs.166468 


programmed eel! death 5 


3.23 


0.18 


upregulate stage 


426115 


H08895 


Hs.166733 


teucyUcystinyi aminopeptidase 


1 


0.32 


upregulate stage 


426168 


NMJJ03152 


Hs.167503 


signal transducer and activator of trans 


1.97 


0.4 


upregulate stage 


426257 


AL137201 


Hs.168625 


KIAA0979 protein 


1 


0.29 


upregulate stage 


426261 


AW242243 


Hs.168670 


peroxisomal famesylaied protein 


2.8 


0.16 


upregulate stage 


426283 


NM003937 


Hs.169139 


kynureninase (L-kynurenine hydrolase) 


14.75 


0.04 


upregulate stage 


426451 


A1908165 


Hs.169946 


GATA-binding protein 3 


3.05 


0.28 


upregulate stage 


426462 


U59111 


Hs.1 69993 


dermatan sulphate proteoglycan 3 


1 


0.36 


upregulate stage 


426490 


NM_001621 


Hs.1 70087 


aryi hydrocarbon receptor 


14.17 


0.05 


upregulate stage 


426514 


BE616633 


Hs.301122 


bone morphogenetic protein 7 (osteogenic 


1.15 


0.47 


upregulate stage 


426561 


AA381437 




gb:EST945 1 4 Activated T-cells 1 Homo sap 


5.65 


0.11 


upregulate stage 


426711 


AA383471 


Hs.180669 


conserved gene amplified in osteosarcoma 


11.05 


0,05 


upregulate stage 


426731 


AW303411 


Hs.130332 


ESTs 


2.4 


0.21 


upregulate stage 


426759 


AI550401 


Hs.21213 


ESTs 


9.5 


0.06 


upregulate stage 


426786 


AA319798 


Hs.172247 


eukaryouc translation elongation factor 


9.25 


0.09 


upregulate stage 


426788 


U66615 


Hs.172280 


SWl/SNF related, matrix associated, acG 


5.63 


0.14 


upregulate stage 


426818 


AA554827 


Hs. 124841 


ESTs, Weakly similar to ALU5.HUMAN ALU S 


9 


0.08 


upregulate stage 


426824 


087717 


Hs.1 72652 


KIAAO0 1 3 gene product 


1 


0.87 


upregulate stage 


426827 


AW067805 


Hs.172665 


nrathylenetetrahydrofoiate dehydrogenase 


8.95 


0.09 


upregulate stage 


426921 


AA037145 


Hs.172865 


cleavage stimulation factor, 7 pre-RNA, 


1 


0.19 


upregulate stage 


426935 


NM_000088 


Hs.172928 


collagen, type 1, alpha 1 


1.15 


0.72 


upregulate stage 


426997 


BE620738 


Hs.1 731 25 


peptidyi prolyl isomerase F (cyclophStn 


11.06 


0.06 


upregulate stage 


427071 


AA397958 


Hs.192719 


ESTs 


5.75 


0.08 


upregulate stage 


427126 


AA620613 


Hs.191827 


ESTs 


2.55 


0.18 


upregulate stage 


427134 


AA398409 


Hs.173561 


EST 


3.4 


0.18 


upregulate stage 


427142 


AA398510 


Hs.133148 


ESTs 


1 


0.25 


upregulate stage 


427259 


AA400096 




gb:zu69f07.s1 Scares testis NHT Homo sap 


1 


0.22 


upregulate stage 


427308 


D26067 


Hs.174905 


WAA0033 protein 


5.9 


0.1 


upregulate stage 


427315 


AA179949 


Hs.175563 


Homo sapiens mRNA; cDNA DKFZp564N0763 (f 


6.12 


0.11 


upregulate stage 


427356 


AW023482 


Hs.97849 


ESTs 


Z7 


0.13 


upregulate stage 


427370 


AI243615 


Hs.97740 


ESTs 


3.6 


0.14 


upregulate stage 


427376 


AA401533 


Hs.19440 


ESTs 


2.1 


0.16 


upregulate stage 


427387 


BE244966 


Hs.177584 


3-oxoarid CoA transferase 


1 


0.39 


upregulate stage 


427470 


AW999924 


Hs.178357 


Homo sapiens cDNA FU 13657 fis, clone PL 


3.4 


0.16 


upregulate stage 


427519 


AW085233 


Hs.1 80696 


ESTs 


8.23 


0.1 


upregulate stage 


427521 


AW973352 


Hs.299056 


ESTs 


7.75 


0.1 


upregulate stage 


427528 


Al/077143 


Hs.179565 


rnrnichrornosome maintenance deficient (S. 


5.7 


0.15 


upregulate stage 


427566 


AI743515 




gb:wf72b08.x2 Soares_NFl_T_GBC_S1 Homo s 


1 


1 


upregulate stage 


427581 


NM.014788 


Hs.179703 


KIAA0129gene product 


11.45 


0.06 


upregulate stage 


427585 


031152 


Hs.179729 


collagen, type X, alpha 1 (Schmid metaph 


3.25 


0.16 


upregulate stage 


427603 


A1090838 


Hs.98006 


ESTs 


1 


1 


upregulate stage 


427646 


AI578042 


HsJ71953 


ESTs 


5.7 


0.11 


upregulate stage 


427652 


AI673025 


Hs.43874 


ESTs 


1 


0.34 


upregulate stage 


427742 


AA411880 


Hs.190888 


ESTs 


Z4 


0,16 


upregulate stage 


427814 


W28383 


Hs.180900 


WiDiams-Beuren syndrome chromosome regi 


9.13 


0.08 


upregulate stage 


427839 


AA608823 


Hs.98244 


ESTs 


1.9 


0.19 


upregulate stage 


427878 


C05766 


Hs.181022 


CGI-07 protein 


4.1 


0.14 


upregulate stage 


427922 


AK001934 


Hs.181112 


HSPC126 protein 


2.6 


0.19 


upregulate stage 


427933 


AW974643 


Hs.190571 


ESTs 


4.55 


0.14 


upregulate stage 
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427934 


AA810541 


Hs.291856 


ESTs 


1 


1 


upregulate stage 


427944 


AA417878 


Hs.48401 


ESTs, Weakly similar to ALU8 JWMAN ALU S 


6.15 


0.1 


upregutate stage 


427961 


AW293165 


Ks.143134 


ESTs 


4.85 


0.11 


upregulate stage 


4279B6 


N45214 


Hs.282387 


Homo sapiens cDNA FU21837 fis, clone H 


3,55 


0.13 


upregulate stage 


428003 


AL1 10200 


Hs.1 81384 


Homo sapiens mRNA; cDNA DKFZp586B0922 (f 


1.45 


0.36 


upregulate stage 


428004 


AA449563 


Hs.300270 


ESTs 


3.95 


0.12 


upregutate stage 


428010 


AA806554 


Hs.185375 


ESTs 


1 


0.38 


upregulate stage 


428057 


AJ343641 


Hs.185798 


ESTs 


10.1 


0.06 


upregulate stage 


428053 


AI821625 


Hs.1 91602 


ESTs 


1 


0.5 


upregulate stage 


428071 


AF2 12848 


Hs 182339 


ets homologous factor 


6.4 


0.09 


upregulate stage 


428182 


BE386042 

O UvvU v"t 


Hs 293317 


ESTs, Weakly similar to JM27 {H.saptensJ 


1 


0.23 


upregulate stage 


428192 


M424051 




gb:zv80dQ3.s1 Soares_iotaJ_fetus_Nb2H F8_ 


2.45 


0.16 


upregulate stage 


428227 


AA321649 


Hs.2248 


small inducible cytokine subfamily B (Cy 


9.25 


0,04 


upregulate stage 


428403 


A1393048 


Hs 239894 


tptlninn rirh rpnnat fin Fl \\\ intersclin 


9.94 


0.06 


upregulate stage 


428436 


BE0B0180 




gb:RC4-BT0629-1 20200-01 1-51 0 BT0629 Homo 


1 


1 


upregulate stage 


428450 


mm 014791 


Hs 184339 


KIAA01 75 oenp nroduct 


4.43 


0.16 


upregulate stage 


428479 


Y00272 


Hs.184572 


cell division cyde 2, G1 to S and G2 to 


9.2 


0.07 


upregulate stage 


428529 


AW2B2022 


Hs.106278 


Homo sapiens cDNA FL) 12839 fis, clone NT 


1 


1 


upregulate stage 


428576 


AW009330 


Hs.167621 


ESTs 


1 


0.3 


Upregulate stage 


428605 


AB037862 


Hs.1 86756 


KIAA1441 protein 


9.25 


0.09 


upregulate stage 


428664 


AK001666 


Hs.189095 


similar to SAU.1 (sal (DrosophilaHike 


3.8 


0.08 


upregulate stage 


428685 


AF131853 


Hs.1 89527 


Homo sapiens done 2501 6 mRNA sequence 


1 


1 


upregulate stage 


428716 


AL122118 


Hs.190614 


Homo sapiens mRNA; cDNA DKFZp43401221 (f 


1 


0.65 


upregulate stage 


428783 


AW070204 


Hs.178176 


ESTs 


1.6 


0.23 


upregulate stage 


428788 


Af 082283 


Hs.193516 


B-cell CLL/tymphoma 10 


9.6 


0.08 


upregulate stage 


428829 


R 14050 


Hs.194051 


Homo sapiens mRNA; cDNA DKF2p566B213 (fr 


5.45 


0.11 


upregulate stage 


428839 


AJ767756 


Hs.82302 


ESTs 


10 


0.06 


upregulate stage 


428881 


AI298368 


Hs.98918 


ESTs 


1.4 


0.18 


upregutate stage 


428954 


AF1 00781 


Hs.194678 


WHT1 Inducible signaling pathway protein 


1 


1 


upregulate stage 


428988 


AA442900 


Hs.27947 


ESTs 


3.05 


0.13 


upregutate stage 


429042 


AW0 15439 


Hs.235920 


ESTs 


1 


0.56 


upregulate stage 


429057 


AF1 56557 


Hs.194816 


stomaSn-tike protein 1 


0.95 


0.93 


upregulate stage 


429065 


AA868555 


Hs.178222 


ESTs 


6 


0.11 


upregulate stage 


429072 


AJ376228 


Hs. 106043 


Friend leukemia virus integration 1 


1 


1 


upregulate stage 


429083 


Y09397 


Hs.227817 


BCL2-felated protein A1 


11.12 


0.03 


upregulate stage 


429091 


AA935658 


Hs.1 87939 


ESTs 


8.9 


0.08 


upregulate stage 


429115 


AA446728 


Hs 289020 


Homo sapiens cDNA FU 14098 Ms, clone MA 


4.1 


0.17 


upregulate stage 


429127 


AA749382 


Hs.107233 


ESTs 


1 


0.23 


upregulate stage 


429135 


AA446966 


Hs.99090 


ESTs, Moderately similar to similar to K 


1 


1 


upregulate stage 


429170 


NM 001394 


Hs.2359 


dual soecificitv nhnsohatasp 4 


8.6 


0.08 


upregulate stage 


429174 


BE559598 


Hs 197803 


K1AA01 60 protein 


8.4 


0.08 


upregulate stage 


429236 


AA448407 




nh*7wfiRrf1 1 <?1 Snprp*; tpQffe MHT Hnmn «;An 
yu.ivruou 1 1 ^ 1 <ju«uco_icoua_iiri i nuuiu ocxp 


\ 


0*36 


unreatilafp. staas 

UpicyuKJlu 0 lay a 


429268 


AA205386 


Hs 198481 


RAR-reJaled orphan receptor B 


2.9 


0.16 


upregulate stage 


429300 


AB011108 


Hs.198891 


serine/threonirie-protein kinase PRP4 bom 


4.25 


0.15 


upregulate stage 


429334 


D63078 


Hs 188180 

no. 100 iou 


Hnmn cantons rCNA: FU23038 fis Hnrw L 


2.95 


0.11 


upregulate stage 


429344 


R94038 


l-k 1995.18 
no, isaiMO 


inhiMn hetaC 


2.91 


0.28 


upregulate stage 


429359 


W00482 


Hs.2399 


matrix metaltoproteinase 1 4 (rrierobrHne-in 


1.19 


0.68 


upregulate stage 


429376 




Hs.43227 


ESTs 


1 


1 


upregulate stage 


429412 




Hs.2407 


Pflll rinmain rias<i 9 accnfiafirkn farfnr 


8.15 


0.07 


upregulate stage 


429450 


AA824451 


Hs 94292 


Homo sapiens cDNA: FU 23311 fis, clone H 


3.3 


0.17 


upregulate stage 


429472 




Hs 15852 


ESTs 


1 


1 


upregulate stage 


429482 


AF076974 


Hs 203952 


transformation/transcription domaln-asso 


1.52 


0.59 


upregulate stage 


429486 


AF1 55877 




hvtWhpttml nmtoin R 110139 


2.9 


0.15 




429572 


AW295375 


Hs 39474 


ESTs 


1 


0.95 


upregulate stage 




Alfl177R5 
nJO 1 1 1 OH 


no. tOuUOi 


nmlpin Itfnasp ^AMP-dpnpnripnt rpjnilatn 


6.55 


0,1 


upregulate stage 


429590 




Hs.44445 


ESTs, Weakly similar to Ketch motif cont 




1 


upregulate stage 


429597 




Hs.2442 


a riKlntpnrin smA fnpfallonrnlpjnfKA rinma 


5.6 


0.14 


upregulate stage 


429601 


AI8Q4293 


Hs 119406 


ESTs, Weakly similar to AF1 43946 1 trans 


1*35 


0.58 


upregulate stage 


429602 


AA521463 


Hs 183424 


ESTs 


1 


0.34 


upregulate stage 


429617 


X89984 


Hs.211563 


B-cell CLL/lymphoma 7A 


10.8 


0.07 


upregulate stage 


429629 


BF501732 


Hs.30622 


Homo sapiens cDNA FU 13010 fis, clone NT 


3.4 


0.12 


upregulate stage 


429631 


AA455612 


Hs 136710 


EST 


1 


1 


upregulate stags 


429644 


AA455892 


Hs.156379 


ESTs 


3.4 


0.15 


upregulate stage 


429653 


NM_005955 


Hs.211581 


metal-regulatory transcription factor 1 


4.45 


0.17 


upregulate stage 


429664 


L20433 


Hs.211588 


POL) domain, class 4, transcription facto 


1.17 


0.74 


upregulate stage 


429673 


AA884407 


Hs.211595 


protein tyrosine phosphatase, norwecept 


3.19 


0.22 


upregulate stage 


429699 


AI383469 


Hs. 159300 


ESTs 


4.4 


0.1 


upregutate stage 


429782 


NM_005754 


Hs.220689 


Ras-GTPase-activating protein SH3-domatn 


4.25 


0.13 


upregulate stage 


429813 


AW139878 


Hs.180791 


ESTs 


1 


0.95 


upregulate stage 


429828 


AB019494 


Hs.225767 


1DN3 protein 


4.2 


0.14 


upregulate stage 


429838 


AW904907 


Hs.108241 


ESTs, Weakly similar to The K1AA0191 gen 


3.25 


0.14 


upregutate stage 


429859 


NM.007050 


Hs.225952 


protein tyrosine phosphatase, receptor t 


1 


0.36 


upregulate stage 


429913 


AA460608 


Hs.99552 


ESTs 


1.35 


0.2 


upregulate stage 


429917 


H80572 




gtoyu76c02.r1 Soares fetal liver spleen 


4.8 


0.13 


upregulate stage 


429921 


AA526911 


Hs.102756 


ESTs 


1 


0.63 


upregulate stage 


429950 


AW081608 


Hs.105053 


ESTs 


3.7 


0.13 


upregulate stage 


429971 


AF079550 


Hs.227098 


glial cells missing (Drosophita) homolog 


1 


0.B3 


upregulate stage 
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429979 


AA463338 




gb:zx97a10/1 Soares_NhHMPu_S1 Homosapi 


1 


0.32 


upregulate stage 


429982 


AW449534 


Hs.99607 


Homo sapiens cONA FU13B41 fis, clone TH 


9.45 


0,08 


upreguiate stage 


429986 


AF092047 


Hs.227277 


sine ocuiis homeobox (Drosophiia) homoto 


0.78 


0.58 


upregutate stage 


430020 


AI539029 


Hs.99607 


Homo sapiens cDNA FU 13841 fe. clone TH 


1 


0.36 


upregulate stage 


430021 


AA463913 


Hs.221160 


ESTs 


1 


0.56 


upregulate stage 


430049 


AW277085 


Hs.99619 


ESTs 


3.55 


0.17 


upregulate stage 


430060 


NM.002941 


Hs.301198 


roundaboul (axon guidance receptor, Dros 


1 


0.59 


upreguiate stage 


430076 


AA465115 




gb:aa32c11/1 Na_CGAP_GCB1 Homo sapiens 


5.4 


0.12 


upreguJale stage 


430134 


BE380149 


Hs.105223 


ESTs, Weakly similar to contains similar 


3.6 


0.13 


upregulate stage 


430184 


AB013802 


Hs.234790 


contacts 5 


1 


1 


upregulate stage 


430195 


AW969308 


Hs.188594 


ESTs 


9.15 


0.1 


upregulate stage 


430279 


R85974 


Hs.16279 


ESTs 


1.2 


0.52 


upreguiate stage 


430287 


AW1 82459 


Hs.125759 


ESTs, Weakly similar to tumor suppressor 


4.05 


0.15 


upregulate stage 


430291 


AV660345 


Hs.238126 


C6M9 protein 


7.2 


0.08 


upregulate stage 


430299 


W28673 


Hs.106747 


serine carboxy peptidase 1 precursor prat 


11.9 


0.06 


upregulate stage 


430350 


BE169639 




gb:PM14fr0527-28020u-005-a05 HT0527 Homo 


7.1 


0.09 


upregulate stage 


4303B7 


AW372884 


Hs.240770 


nuclear cap binding protein subunit 2, 2 


5.9 


0.11 


upregulate stage 


430486 


BE062109 


Hs.241551 


chloride channel, calcium activated, fam 


2.56 


0.2 


upregulate stage 


430488 


019589 


Hs.4220 


ESTs, Moderately similar b tetracycline 


10.5 


0.08 


upregutate stage 


430512 


AF1 82294 


Hs.241578 


U6 snRhlA-associaied Sm-Kke protein LSm8 


11.2 


0.06 


upregulate stage 


430519 


AF 129534 


Hs.49210 


F-boxonly protein 4 


5.35 


0.11 


upregutate stage 


430550 


AK000062 


Hs.243756 


hypothetical protein FU20055 


1 


1 


upregulate stage 


430561 


BE065227 




gb:RC1-BT0314-31030CM)15-b06 BT0314 Homo 


1 


1 


upreguiate stage 


430563 


AA481269 


HS.1783B1 


ESTs 


1 


0.45 


upreguiate stage 


430598 


AK001764 


Hs.247112 


hypothetical protein FU 10902 


4.75 


ai4 


upregulate stage 


430530 


AW269920 


Hs.2621 


cystatinA(stefinA) 


2.52 


0.25 


upregulate stage 


430634 


AJ860651 


Hs.26685 


ESTs 


1.24 


0.61 


upregulate stage 


430637 


BE 160081 


Hs.256290 


S100 catchim-bintfing protein A11 (caigiz 


1.79 


0.47 


upregulate stage 


430640 


AA482636 




gb:zv29c06 /1 Scares ovary tumor NcHOT H 


9.15 


0.08 


upregulate stage 


430665 


BE350122 


Hs.1 57367 


ESTs 


9.4 


0.08 . 


upregulate stage 


430726 


AL031224 
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a75 


0.1 


upregutatB stage 


435255 


W87434 


Hs.106015 


ESTs. Moderately similar to ALU1.HUMAN A 


3.4 


0.14 


upregulate stage 


435256 


AF193766 


Hs.13872 


cytokine-like protein CI 7 


3.2 


0.14 


upregulate stage 


435257 


AA677026 


Hs.191217 


ESTs 


4.5 


0.12 


upregulate stage 


435298 


AA677696 


Hs.189196 


ESTs 


1 


1 


upregulate stage 


435307 


W90610 


Hs.192003 


ESTs 


1 


0.87 


upregulate stage 


435347 


AW014873 


Hs.1 16963 


ESTs 


2.45 


0,14 . 


upregulate stage 


435382 


N54493 




gfcyv40g05.s1 Soares fetal liver spleen 


1 


0.56 


upregulate stage 


435408 


H07897 


Hs.4302 


ESTs 


0.84 


0.97 


upregulate stage 


435491 


T98543 


Hs.191900 


ESTs 


1 


0.3 


upreguiate stage 
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435525 


AI831297 


Hs.123310 


ESTs 


3.25 


0.15 


upregulate stage 


435597 


AW305188 


Hs. 163027 


ESTs 


1 


0.57 


upregulate stage 


435637 


AF220051 


Hs.1 10853 


uncharacterized hematopoietic stem/progs 


a76 


0.09 


upregulate stage 


435547 


AI653240 


Hs.49823 


ESTs 


3.19 


0,25 


upregulate stage 


435738 


AA699633 


Hs.269543 


ESTs 


2.9 


0.16 


upregulate stage 


435762 


AW043836 


Hs.212460 


ESTs 


1 


1 


upregulate stage 


435809 


H90213 


Hs. 191330 


ESTs 


1.25 


0.23 


upregulate stage 


435826 


Al 554089 


Hs.1 17880 


ESTs 


6.55 


0.08 


upregulate stage 


435854 


AJ278120 


Ks.4996 


OKFZP564D166 protein 


2.76 


0.29 


upregulate stage 


435979 


W03698 


Hs.83513 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


1 


0.25 


upregulate stage 


435981 


H74319 


Hs.1 88620 


ESTs 


6.35 


0.11 


upregulate stage 


435990 


Al 01 5862 


Hs.1 31 793 


ESTs 


4.25 


0.08 


upregulate stage 


435999 


AA703271 


Hs.269903 


ESTs, Moderately similar to ALULHUMAN A 


1 


1 


upregulate stage 


436016 


AA806465 


Hs. 121536 


ESTs 


1.45 


0.23 


upregulate stage 


436023 


T81819 




gb:yd95f05.s1 Soares fetal liver spleen 


9.15 


0.07 


upregulate stage 


436052 


A1021983 


Hs.271432 


ESTs 


1 


0.23 


upregulate stage 


436115 


AW512033 


Hs.102004 


ESTs 


1.9 


0.21 


upregulate stage 


436118 


AI221 173 


Hs.145080 


ESTs 


1 


1 


upregulate stage 


436120 ' 


AI248193 


Hs.1 19860 


ESTs 


9.61 


0.08 


upregulate stags 


436149 


A1754308 


Hs.1 59452 


ESTs 


2.4 


0.19 


upregulate stage 


436156 


AA705466 


Hs.1 19900 


ESTs 


1 


0.26 


upregulate stage 


436170 


AW450381 


Hs.1 4529 


ESTs 


1 


0.91 


upregulate stage 


436202 


AA706315 


Hs.1 92057 


ESTs 


1 


1 


upregulate stage 


436246 


AW450963 


Hs.1 19991 


ESTs 


3.65 


0.11 


upregulate stage 


436274 


AA732755 


Hs.1 20299 


ESTs 


£85 


0.13 


upregulate stage 


436282 


R91913 


Hs.272104 


ESTs 


3.95 


0.11 


upregulate stage 


436299 


AK000767 


Hs.5111 


hypothetical protein FU 20729 


6 


0.09 


upregulate stage 


436300 


AA831601 


Hs.275736 


ESTs 


1 


1 


upregulate stage 


436326 


BE085236 


Hs.181244 


major histocompatibility complex, class 


2.49 


0.28 


upregulate stage 


436360 


AI962796 


Hs.136754 


ESTs 


2.4 


0.17 


upregulate stage 


436363 


AA843926 


Hs.1 24434 


ESTs 


3.35 


0.1 


upregulate stage 


436383 


BE065178 




gb:RC1-BT0314-020200-Q12-h01 BT0314 Homo 


1 


0.67 


upregulate stage 


436396 


A1683487 


Hs.299112 


Homo sapiens cONA FU1 1441 fis, clone HE 


4 


0.15 


upregulate stage 


436408 


AW274577 


Hs.252954 


ESTs 


9.1 


0.09 


upregulate stage 


436422 


AA716141 


Hs.147027 


ESTs 


1 


0.26 


upregulate stage 


436429 


AA357003 


Hs.1 7546 


hypolhetical protein FU 23499 


1 


1 


upregulate stage 


436463 


H08502 


Hs.6656 


ESTs 


5.45 


0.12 


upregulate stage 


436476 


AA326108 


Hs.53631 


ESTs, Weakry similar to enhartcer-of-spB 


7.75 


0.07 


upregulate stage 


436507 


AA721209 


Hs.201630 


ESTs 


2.45 


0.18 


upregulate stage 


436518 


AA766433 


Hs.122864 


ESTs 


3.15 


0.21 


upregulate stage 


436522 


AA721381 


Hs.1 29876 


ESTs 


4.75 


0.1 


upregulate stage 


436578 


A1091435 


Hs.134859 


ESTs 


3.4 


0.12 


upregulate stage 


436670 


Af 690021 


Hs.201536 


ESTs 


6.85 


0.11 


upregulate stage 


436740 


AW975133 




gb:EST387239 MAGE resequences, MAGN Homo 


7.4 


0.09 


upregulate stage 


436764 


AW976004 


Hs.291731 


ESTs 


1 


1 


upregulate stage 


436785 


AA745597 


Hs.291400 


ESTs 


1 


1 


upregulate stags 


436823 


AW749865 


Hs.293645 


ESTs 


4.6 


0.12 


upregulate stage 


436831 


AA830173 


Hs.291918 


ESTs 


1.6 


0.27 


upregulate stage 


436839 


AA767346 


Hs^91614 


ESTs 


1 


1 


upregulate stage 


436844 


AA766458 


Hs.1 22812 


ESTs 


1.5 


0.28 


upregulate stage 


436853 


BE328074 


Hs.148661 


ESTs 


5.05 


0.14 


upregulate stage 


436860 


H12751 


Hs.5327 


PROI 914 protein 


8.95 


0.08 


upregulate stage 


436925 


AA742327 


Hs.292687 


ESTs 


1 


1 


upregulate stage 


437044 


AL035864 


Hs.69517 


ESTs, Highly similar to differentially e 


1.61 


0.5 


upregulate stage 


437087 


AA745563 




gb:ny60e04.s1 NCI_CGAP_Pr18 Homo sapiens 


1 


1 


upregulate stags 


437144 


AL049466 


Hs.7859 


ESTs 


1 


0.31 


upregulate stage 


437170 


R49202 


Hs.181694 


ESTs 


8.7 


0.08 


upregulate stage 


437181 


AJ 30661 5 


Hs.1 25343 


ESTs, Weakly similar b K1AA0758 protein 


4.4 


0.05 


upregulate stage 


437207 


T27503 


Hs.1 5929 


Homo sapiens cDNA FU12910 lis, clone NT 


1.4 


0.22 


upregulate stage 


437214 


BE092336 




gb:lL2-BT0734-24040CW)72-A1 2 BT0734 Homo 


5.65 


0.09 


upregulate stage 


437240 


AA747537 




gb:nx85c05.s1 NCI CGAP GCB1 Homo sapiens 




0.45 


upregulate stags 


437257 


Al 283085 


Hs.290931 


ESTs, Weakly similar to unknown [S.cerev 


3.8 


0.14 


upregulate stage 


437258 


AUM1243 


Hs.174104 


ESTs 


9.72 


0.08 


upregulate stage 


437267 


AW511443 


Hs.258110 


ESTs 


4.25 


0.12 


upregulatB stage 


437274 


M747965 




gb:nx79al0.s1 NCI_CGAP_Ew1 Homo sapiens 


1 


0.19 


upregulate stage 


437288 


AA748182 


Hs.160377 


ESTs 


1 


0.61 


upregulate stage 


437311 


AA370041 


Hs.9456 


SW1/SNF related, matrix associated, acti 


3.1 


0.17 


upregulate stage 


437324 


AL162077 




gb:Homo sapiens mRNA; cDNA OKF2p761 A219 


1 


0.25 


upregulate stage 


437330 


AL353944 


Hs.50115 


Homo sapiens mRNA; cDNA DKFZp761J1112 (f 


7.95 


0.07 


upregulate stage 


437356 


BE622396 


Hs.284252 


Homo sapiens mRNA; cONA DKFZp76201615 (f 


1 


1 


upregulate stage 


437438 


AL359620 


Hs.14217 


hypothetical protein DKFZp762P2111 


9.15 


0.09 


upregulate stage 


437471 


AL390169 




gb:Homo sapiens mRNA; cDNA DKFZp547D064 


1 


1 


upregulate stage 


437553 


A1829935 


Hs.130497 


ESTs. Weakly similar to MAT8.HUMAN CHLOR 


1.1 


0.56 


upregulate stage 


437567 


AW627990 


Hs.288954 


Homo sapiens cONA: FU21466 fis, clone C 


1 


1 


upregulate stage 


437575 


AW954355 


Hs.36529 


ESTs 


10.25 


0.06 


upregulate stage 


437717 


AA804765 


Hs.132853 


ESTs 


1 


0.77 


upregulate stage 


437722 


AW292947 


Hs.122872 


ESTs 


9.75 


0.05 


upregulate stage 
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437752 


AA767376 


Hs.291631 


ESTs 


6.7 


0.OB 


upregulate stage 


437770 


AA767881 


Hs.1 22897 


ESTs 


2.57 


0.24 


upreguiate stage 


437798 


AW811767 




flb:RC2-ST016&-300999-01 1-g02 ST0165 Homo 


1 


1 


upregulate stage 


437799 


R510B3 


Hs.90787 


ESTs 


1 


1 


upregulate stage 


437852 


8E001836 


Hs.256897 


ESTs, Weakly similar to (1J36501 2. 1 [H.sa 


1.77 


0.3 


upregulate stage 


437886 


BE264111 


Hs.31314 


retlnobtastoma-blnding protein 7 


as 


0.08 


upregulate stage 


4378B7 


AAB11524 


Hs.29263 


Homo sapiens cOMA FLM 1896 Rs, clone HE 


3.8 


0.17 


upregulate stage 


4378B9 


AA830524 


Hs.1 24357 


ESTs 


1 


0.69 


upregulate stage 


437937 


AI917222 


Hs.1 21655 


ESTs 


1 


0.38 


upregulate stags 


437938 


A1950087 




gbwq05c02jc1 NCLCGAPJGd 1 2 Homo sapJen 


1.37 


0.52 


upregulate stage 


437983 


AI303023 


Hs.30211 


hypothetical protein FLJ22313 


8.82 


0.08 


upregulate stage 


438011 


BE466173 


Hs.145695 


splicing factor (CC 1.3} 


9 


0.09 


upregulate stage 


438032 


8E045624 


Hs.1 52992 


ESTs 


5.65 


0.13 


upregulate stage 


438069 


N80701 


Hs.33790 


ESTs 


3.25 


0.13 


upregulate stage 


438077 


AA777330 


Hs.50429 


ESTs 


1 


1 


upreguiate stage 


438081 


H49546 


Hs.298964 


ESTs 


a75 


0.11 


upregulate stage 


438102 


AA777793 


Hs.137580 


xytuloltinase (H. Influenzae) homolog 


1 


1 


upregulate stage 


438112 


W85729 


Hs.194279 


ESTs 


1 


0.33 


upregulate stage 


438113 


AI457906 


Hs.8882 


ESTs 


1.21 


0.55 


upregulate stage 


438119 


AW963217 


Hs.203961 


ESTs, Moderately similar to AF1 16721 89 


11.75 


0.07 


upregulate stage 


438144 


AA778894 


Hs.1 18364 


ESTs 


1 


1 


upregulate stage 


438153 


A1268632 


Hs.1 461 59 


ESTs 


1 


1 


upregulate stage 


438171 


AW976507 


Hs.293515 


ESTs 


2.82 


0.24 


upregulate stage 


438271 


121934 


Hs.14553 


sterol O-acyl transferase (acyl-Coenzyme 


1 


1 


upregulate stage 


438290 


AA843719 


Hs.1 22341 


ESTs 


a9 


0.13 


upregulate stage 


438321 


AA576635 


Hs.6153 


CGI-48 protein 


9.4 


0.08 


upregulate stage 


438325 


AA804258 


Hs.123229 


ESTs 


4.65 


0.11 


upregulate stage 


438334 


AA806992 


Hs.291686 


ESTs 


1 


1 


upregulate stage 


438365 


AA805760 




gb:ns43f01.s1 NQ CGAP GCB1 Homo sapiens 


1 


0.34 


upregulate stage 


438370 


AA843242 


Hs.48523 


ESTs 


4 


0.14 


upregulate stage 


438374 


AA321866 


Hs.6193 


Homo sapiens mRNA; cDNA DKFZp434C1717 (f 


1 


0.84 


upregulate stage 


438377 


AA806070 


Hs.291715 


ESTs 


1 


0.24 


upreguiate stage 


438378 


AW970529 


Hs.86434 


Homo sapiens cDNA: FU21816 fis. done H 


6.65 


0.11 


upregulate stage 


438401 


AL046321 


Hs.1 97484 


ESTs 


1 


1 


upregulate stage 


438403 


AA806607 


Hs.292206 


ESTs 


2.75 


0.14 


upreguiate stage 


438412 


AA80S776 


Hs.130814 


ESTs 


1 


1 


upregulate stage 


438448 


AA807344 


Hs.1 72932 


Homo sapiens mRNA for partial 3*UTR, seq 


1 


0.34 


upregulate stage 


438451 


AI081972 


Hs.220261 


ESTs 


5.7 


0.09 


upregulate stage 


438473 


H07986 


Hs.1 36901 


ESTs 


1 


1 


upregulate stage 


438487 


AI6B4733 


Hs.88820 


HDCMC28P protein 


1 


0.21 


upregulate stage 


438529 


AW007287 


Hs.28538 


Homo sapiens cDNA: RJ21086 fis, done C 


1 


0.36 


upregulate stage 


438534 


AW204052 


Hs.1 23644 


ESTs 


1 


0.37 


upregulate stage 


438693 


AA814360 


Hs.249595 


ESTs 


3.55 


0.15 


upregulate stage 


438698 


AW297855 


Hs.1 2581 5 


ESTs 


3.12 


0.22 


upregulate stage 


438728 


AA815202 


Hs 25657 


ESTs 


1 


0.67 


upregulate stage 


438746 


A1385815 


Hs.184727 


ESTs 


1.5 


0.35 


upregulate stage 


438805 


AA826048 


Hs 117887 

I IO- 111 Wt 


ESTs 


9.35 


0.07 


upregulate stage 


438812 


AA826199 


Hs.44287 


ESTs 


1 


0.57 


upregulate stage 


438817 


A1023799 


Hs 1S3242 

no* lUvt^L 


ESTs 


4.2 


0.08 


upregulate stage 


438885 


AA82772B 




ESTs Weakfv similar to AF1 49422 2 unkno 


4.05 


0.12 


upregulate stage 


438913 


A1380429 


Hs.172445 


ESTs 


5.4 


0.11 


upregulate stage 


438950 


H23789 


Hs.144530 


ESTs 


1 


1 


upregulate stage 


438961 


H42135 


Hs.101848 


ESTs 


7.85 


0.08 


upregulate stage 


438990 


AF0B5890 




gb:Homo sapiens fuQ length insert cDNA 


1 


0.83 


upregulate stage 


439026 


R98978 


Hs.117767 


ESTs 


1 


0.27 


upregulate stage 


439052 


AF085917 


Hs.37921 


ESTs 


1 


0.22 


upregulate stage 


439057 


H59623 


Hs.271561 


ESTs 


1 


1 


upregulate stage 


439176 


AI446444 


Hs.190394 


ESTs 


5.8 


0.12 


upreguiate stage 


439179 


AA831250 


Hs.292693 


ESTs 


1 


1 


upregulate stage 


439183 


AW970600 




gb:EST382681 MAGE resequences, MAGK Homo 


4.5 


0.13 


upregulate stage 


439208 


AK000299 


Hs.180952 


dynacfjn p62 subunrt 


11.9 


0.06 


upregulate stage 


439212 


AF087995 


Hs.1 34877 


ESTs 


11.4 


0.07 


upregulate stage 


439223 


AW238299 


Hs.23945 


ESTs 


2.79 


0.26 


upregulate stage 


439312 


AA833902 


Hs.270745 


ESTs 


8.9 


0.08 


upregulate stage 


439330 


AF086147 




gb:Homo sapiens full length insert cDNA 


1 


0.19 


upregulate stage 


439351 


W37688 


Hs.55158 


ESTs, WeaHy similar to weak similarity 


1 


0.31 


upregulate stage 


439430 


AF124250 


Hs.6564 


breast cancer anti-estrogen resistance 3 


3.9 


0.15 


upregulate stage 


439444 


AI277652 


Hs.54578 


ESTs 


11 


0.07 


upregulate stage 


439453 


BE264974 


Hs.6566 


thyroid hormone receptor interactor 13 


12.21 


0.05 


upregulate stage 


439476 


AA836340 


Hs.165490 


ESTs 


1 


0.65 


upregulate stage 


439492 


AF086310 


Hs.103159 


ESTs 


5.43 


0.1 


upregulate stage 


439527 


AW298119 


Hs.202536 


ESTs 


5.25 


0.1 


upregulate stage 


439550 


H10438 




gb:ym08d10.s1 Soares infant brain 1NIB H 


3.2 


0.18 


upregulate stage 


439560 


BE565647 


Hs.74899 


rrypotrtefcal protein FU 12820 


1.81 


0.32 


upregulate stage 


439565 


AF086386 


Hs.145599 


ESTs 


8.96 


0.07 


upregulate stage 


439592 


AF086413 


Hs.58399 


ESTs 


1 


1 


upregulate stage 


439605 


AF085431 


Hs.134805 


ESTs 


9.15 


0.09 


upregulate stage 
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439605 


W79123 


Hs.58561 


ESTs, Weakly similar to KK)1_HUMAN PROBA 


8.45 


0.05 


upregulatB stage 


439750 


AL359053 


Hs.57654 


Homo sapiens mRNA fufl tenglh insert cDN 


5.43 


0.14 


upieguiate stage 


439780 


AL1096B8 




gb:Homo sapiens mRNA Ml tenglh Insert 


5 


0.09 


upreQulate staje 


439851 


A) 149520 


Hs. 144453 


ESTs 


2.75 


0.21 


uprecjulale stape 
upregutate stage 


439862 


A1571647 


Hs.146170 


hypothetical i^otan FU22969 


6.3 


0.11 


439926 


AW014B75 


Hs. 137007 


ESTs 


33.5 


0.02 


upregulate stage 


439942 


AW993791 


Hs.94881 


ESTs 


9.9 


0.08 


upregulate stage 


439963 


AW247529 


Hs.6793 

1 JO* w 


plaletet-acSvaling facto/ acetylhydrola 


5.59 


0.15 


upregulate stage 


439979 


AW60D291 


Hs.6823 


hvoothetica) orotein FU10430 


6.95 


0.07 


upregulate stage 


43S$87 


AARRDUfi 


Hs 723935 


ESTs 


2.45 


0.17 


1 1 or?»ou! at b stao& 


43 oq no 


tv\ 1 190 1 1 


Hs.6838 


rae hnmnlnn npnp famrh/ mpmhpr P 
100 nuiiRJwy yoiiu toiiuiji ihciiiuci c 


8.75 


0.07 


tmrpfiulalR etarip 


440006 




Hs.6844 




1.68 


0.41 


nnrenidata sIpop 


440012 


AAflfiM79 




nh'flV19pflR ^1 Rname Ipetie NUT Hnmn can 




1 


iinrpnnlatn et^ftA 
upicyuioio oioyo 


440126 


AA375145 


Hs.66194 


ESTs 


1 


1 


Upregulate stage 


440194 


R43809 


Ue. 99Bfifl 


ESTs 




\ 


imrpfiulafo etaoA 


440228 


AF1 25392 


Hs.7089 


flieiiRn rrwfifppH nmfpfn 9 

UIOUUII UIUUUCU plUlDlit (. 


t 




(fnrpniilaip sfsno 

upi pyuioic 0 toy a 


440249 


nld£400w 


lie 1 5R1« 
no. 1 £3J£3 


ESTs 


1.74 


0.44 




440284 




no. IDiuOo 


ESTs 


•j 




upregulate stage 


440334 


RP97R119 
DCZfDI It 


Hs.7165 


£11IC llliyta piDccul £09 


9.45 


0.09 


ttnrwiiilatp etano 




MtVU 1 OOUc 


He 4709^ 


P«?Te 
CO lo 


4 
1 


0.33 


itnmntia}!) dstno 

upicgiHoio oujyy 


440351 


AFniflQ.11 


Re 717Q 
no. / 1 » j 


RAD1 f S nnmhpl hnmntrvi 


3.75 


0.16 


tinrpniilAfn etanft 
ujjicyuioLo oLtaya 


440366 


F08229 


Lie 19R971 


ESTs 


3.5 


0.12 




440462 


T71629 


Hs. 100554 


ESTs 


1.54 


0.52 


upregulate stage 


440527 


AVRfi7117 


Hs. 184164 


ESTs 


3.75 


0.14 


iforpnidaffl sfsaa 

up*cy ujtiu* oioyQ 


440613 


A] 7 33034 


Hs 137079 


ESTs 


3.9 


0.11 


upregulate stage 


440705 


AA904244 


Hs 153205 


ESTs 


3.9 


0.14 


upregulate stage 


440856 


AW993377 


Hs. 130390 


ESTs 


8.95 


0.09 


upregulatB stage 


440899 


AW449445 


Hs.1 72690 


diacytQlycerol kinase, alpha (B0KD) 


2.55 


0.2 


upregulate stage 


440917 


AA909651 


Hs. 160025 


ESTs 


1 


0.17 


upregulate stage 


440380 


AL042005 


Hs.1117 


tn'pepiidyi peptidase II 


8.9 


0.09 


upregulate stage 


440994 


AJ160011 


Hs. 193341 


ESTs 


t29 


0.58 


upregulate stage 


441092 


T99289 


Hs.126556 


EST 


4.9 


0.11 


upregulate stage 
upreQidate stage 


441107 


AA917075 


Hs.190520 


ESTs 


5.9 


0.09 


441131 
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upregulate stage 


450063 


Al 68 1509 


Hs.277133 


ESTs 


4.2 


0.17 


upregulate stage 


450083 


AA131795 


Hs.142001 


ESTs 


3.9 


0.16 


upregulate stage 


450116 


AA005355 


Hs.222882 


ESTs 


1 


1 


upregulate stage 


450121 


AUM0174 


Hs.288927 


Homo sapiens cDNA: FU 22944 fis, done K 


1 


1 


upregulate stage 


450135 


AI810816 


Hs.201142 


ESTs 


4.95 


0.14 


upregulate stage 


450144 


T63961 


Hs.301851 


ESTs 


Z75 


0.13 


upregulate stage 


450149 


AW969781 


Hs.293440 


ESTs, Moderately similar to ZIC2 protein 


a75 


0.14 


upregulate stage 


450151 


A1088196 


Hs.295233 


ESTs 


2.51 


0.28 


upregulate stage 


450152 


AI138635 


Ks.22968 


ESTs 


2.45 


0.15 


upregulate stage 


450195 


AA007352 


Hs.256042 


ESTs 


4.1 


0.14 


upregulate stage 


450221 


AA32B102 


Hs.24641 


cytosketeton associated protein 2 


1 


0.3 


upregulate stage 


450238 


T89593 


Hs.138777 


ESTs 


12^ 


0.07 


upregulate stage 


450257 


AW820313 




gb:QV2-ST0296-150200-028-d02ST0296 Homo 


1 


1 


upregulate stage 


450313 


A1038989 


Hs.24809 


hypothetical protein FU 10826 


4.35 


0.15 


upregulate stage 


450314 


AA574309 


Hs.283402 


TCReta 


10.1 


0.07 


upregulate stage 


450350 


T97817 


Hs.174880 ' 


ESTs 


3.65 


0.1 


upregulate stage 


450411 


D61167 


Hs.202156 


ESTs 


1 


0.67 


upregulate stage 


450447 


AF212223 


Hs.25010 


hypothetical protein P15-2 


10.75 


0.07 


upregulate stage 


450448 


D54299 


Hs.36244 


ESTs 


1 


1 


upregulate stage 


450449 


AI696596 


Hs.202068 


ESTs 


1 


1 


upregulate stage 


450506 


NM.004460 


Hs.418 


fibroblast activation protein, alpha 


11.45 


0.05 


upregulate stage 


450573 


AW964334 




gb:EST376407 MAGE resequences, MAGH Homo 


1.2 


0.2 


upregulate stage 


450628 
450636 


AW382884 


Hs.204715 


ESTs 


4.95 


0.13 


upregulate stage 


A!703076 


Hs^01959 


ESTs 


1 


0.69 


upregulate stage 
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450655 
450664 
450680 
450722 
450751 
450772 
450800 
450824 
450832 
450870 
450937 
450983 
451052 
451067 
451088 
451094 
451096 
451126 
451161 
451166 
451222 
451225 
451228 
451246 
451266 
451276 
451277 
451291 
451326 
451347 
451359 
451365 
451386 
451440 
4514B7 
451492 
451495 
451535 
451553 
451562 
451580 
451592 
451651 
451658 
451684 
451690 
451724 
451743 
451794 
451844 
451903 
451914 
451938 
451939 
451971 
451998 
452028 
452036 
452099 
452122 
452163 
452179 
452198 
452206 
452234 
452240 
452247 
452250 
452256 
452266 
452277 
452281 
452291 
452323 
452331 
452345 
452367 



AI707846 

AA808358 

AF131784 

AI732318 

AJ733251 

BE326391 

BE395161 

R09055 

AW970502 

AA011471 

R49131 

AA305384 

AA281504 

BE172186 

AA015600 

A1949B25 

BE383234 

H3O600 

AA211329 

T98171 

AA0183B6 

AI433694 

AI767166 

AW189232 

AA016292 

AW294386 

AK001123 



AW296946 



H85334 

AI791783 

AB029006 

AA017599 

AA018072 

AA018119 

H86887 

AW970577 

AA018454 

H04150 

AW138195 

AI805416 

AI097337 

AW195351 

AF216751 

AW451469 

AI903765 

AW074266 

AA019799 

T61430 

W19817 

AI822115 

A1354355 

U80456 

AA021185 

AW594129 

AK001859 

NM_003966 

BE612992 

AF216833 

A1863140 

H16725 

AI097560 

AW340281 

AW084176 

A1591147 

AL137432 

BE618654 

AK000933 

AI767250 

ALO49013 

T93500 

AF015592 

AA805679 

AA598509 

AA293279 

U71207 



Hs.279860 


hypothetical protein FU 20030 


1 


1 


Hs.36830 


ESTs 


1 


0.34 


Hs.25318 


Homo sapiens dons 251 94 mRNA sequence 


9.51 


0.09 


Hs.101120 


ESTs 


1 


0.87 


Hs.126853 


ESTs, WaaWy similar to JU0033 hypothetl 


1 


1 


Hs.280146 


ESTs, WeaWy similar to JU0033 hypclheti 


1 


1 


Hs.243963 


ESTs, WeaWy similar to ALU5.HUMAN ALU S 


8.7 


0.08 


Hs.269204 


ESTs 


3.03 


0.22 


Hs.105421 


ESTs 


6.15 


0.08 




gbzK)1h08.r1 SoaresJeta1Jver„spteen_ 


1.15 


0.23 


Hs.26267 


A7F<tependant interferon response protei 


9.75 


0.08 


Hs.25740 


ER01 (S. cerevisiae}-IiXe 


3.32 


0.26 


Hs.24444 


ESTs, Moderately similar to ALUE_HUMAN ! 


9.25 


0.08 


Hs.180789 


S164 protein 


Z8 


0.21 


Hs.82415 


ESTs 


1 


0.32 


Hs.260395 


ESTs 


4.45 


0.14 


Hs.25925 


Homo sapiens clone 23860 mRNA sequence 


4.15 


0.14 


Hs.40910 


ESTs 


1 


1 


Hs.26006 


hypothetical protein FU10559 


2.15 


0.16 


Hs.185675 


ESTs 


9.26 


0.08 


Hs.64341 


ESTs 


1 


0.36 


Hs.293608 


ESTs 


9.19 


0.08 


Hs.207025 


ESTs 


1 


1 


Hs.39140 


cutaneous T-cell lymphoma tumor antigen 


7.35 


0.11 


Hs.290849 


ESTs 


1 


0.33 


Hs.236533 


ESTs, Highly similar to d)742C19.2 [H.sa 


1 


1 


Hs.26176 


hypothetical prolan FU10261 


11.7 


0.06 


Hs.6702 


ESTs 


1 


1 


Hs.300967 


ESTs 


10.55 


0.07 


Hs.101139 


ESTs 


1 


1 




gbys90e05.r1 Soares retina N2b5HR Homo 


2.7 


0.15 




gb:op20h10.y5 NCI_CGAP_Co12 Homo sapiens 


8.9 


0.09 


Hs.26334 


spastic paraplegia 4 (autosomal dominant 


2.45 


0.19 


Hs.293817 


ESTs 


1 


1 




gbze51g02.r1 Soares retina N2MHR Homo 


5.7 


0.1 


Hs.297824 


ESTs, Highly similar to QK1_HUMAN VOLTA 


1 


1 




gfcyt07aQ1.M Soares retina N2b5HR Homo 


4.25 


0.13 




gb:EST382658 MAGE resequences, MAGK Homo 


6.8 


0.12 


Hs.269211 


ESTs, WeaWy similar to B34087 hypotheti 


1 


1 


Hs.107708 


ESTs 


4.65 


0,11 


Hs.184326 


COC10 (cell division cycle 10, S. cerevi 


1 


0.42 


Hs.213897 


ESTs 


Z8 


0.17 


Hs.88977 


hypothetical protein dJ511E16.2 


1 


0.18 


Hs^50520 


ESTs 


9.55 


0.07 


Hs^6B13 


COA14 


3.7 


0.15 


Hs.209990 


ESTs 


10.66 


0.07 




gb:ULffT037-301298-102 BT037 Homosaplen 


8.85 


0.09 


Hs.23071 


ESTs 


117 


0.35 


Hs.111911 


ESTs 


1 


1 



1 0.95 

1 1 

1 0.26 

1 0.21 

1.76 0.41 

8.9 0.07 



gb:yc06a03.s1 Stratagene lung (937210) H 6.5 0.1 1 

Hs.261003 ESTs, Moderately similar to B34087 hypot 2.2 0.21 

Hs.270618 ESTs, WeaWy similar to KIAA0822 protein 11.67 0.07 

Hs.16697 ^wn-regulatoroftranscripUonl.TBP* 11.65 0.06 

Hs.2731 1 single-minded (Orosophfla) homotog 2 

Hs.226306 ESTs 

Hs.213666 ESTs 

Hs.27595 hypolhelicaJ protein FU 10997 

Hs.27621 sema domain, seven thrombosponcftn repeat 

Hs.27931 hypothetical protein FU10607 simitar to 

Hs.1710 ATP-bindirig cassette, sub-family B {MDR/ 1 0.47 

gb:tz43h12Jt1 NCl_CGAP_Brn52 Homo sapien 1 0.2 

Hs.27463 ESTs 3 0.13 

Hs.61210 ESTs 1 0.28 

Hs.33074 ESTs, Moderately similar to ALULHUMAN A 12.4 0.0J 

Hs.223296 ESTs 6.8 0.09 

Hs.61232 ESTs 3,75 0.07 

Hs.28564 hypothetical protein DKFZp761E1824 19 0.15 

Hs.28507 hypothetical protetnA-211C6.1 8.75 0.09 

Hs^8661 Homo sapiens cONA FU 10071 fis, done HE 5.2 0.09 

Hs.165240 ESTs 10.45 0.06 

Hs.28783 KIAA1223 protein a9 0.05 

Hs.28792 Homo sapiens cDNAFUl 1041 fis, clone PL 8.2 0.04 

Hs.28853 COC7(ceDdivisloflcycte7.S.cerevfei 15 0.13 

Hs.61271 ESTs 15 0.14 

Hs^9117 Ksapiens mRNA for pur alpha extended 3" 11.75 0.07 

Hs^9173 hypothetical protein FU 2051 5 1.08 0.73 

Hs.29279 eyes absent (Drosophila) homolog 2 10.15 0.07 
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452401 


NM.007115 


Hs.29352 


tumor necrosis factor, alpha-induced pro 


1 


0.17 


452404 


AW450675 


Hs.212709 


ESTs 


3.63 


0.2 


452430 


AF118083 


Hs.29494 


PROI 912 protein 


1 


0.41 


452436 


BE077546 


Hs.31447 


ESTs 


10 


0.07 


452457 


AW062499 




gb:MRO-CT0065-100899-001-d02 CT0065 Homo 


1 


0.13 


452461 


N78223 


Hs.108106 


transcription factor 


8.1 


0.06 


452518 


AA280722 


Hs.24758 


ESTs 


9.3 


0.08 


452519 


BE006701 




gb:RCO-BN0132-270300-021-a03 BN0132 Homo 


1 


0.19 


452524 


AW136499 


Hs.29796 


Homo sapiens mRNA; cDNA DKFZp434D1319 (f 


1 


0.45 


452531 


AA429462 


Hs.293946 


ESTs 


2.94 


022 


452547 


AA335295 


Hs.74120 


adipose specific 2 


1.51 


0.53 


452560 


BE077O84 




gb:RC5-BT0603-220200-013-C07 BT0503 Homo 


5.35 


0.11 


452571 


W31518 


Hs.34665 


ESTs 


2.55 


0.11 


452607 


A1160029 


Hs.61438 


ESTs 


4.75 


0.11 


452677 


BE167202 


Hs.212Q65 


ESTs 


1 


0.32 


452680 


AW138410 


Hs.45051 


ESTs 


1 


1 


452724 


R84810 


Hs.30464 


cydin E2 


1 


0.27 


452738 


AL133800 




gb:DKFZp761A0814 rl 761 (synonym: hamy2) 


3.45 


0.15 


452741 


BE392914 


Hs.30503 


Homo sapiens cDNA FU11344 fis, clone PL 


3.05 


0.16 


452747 


BE153855 


Hs.61460 


ESTs 


£54 


0.28 


452761 


BE244742 


Hs.30532 


CGI-77 protein 


3.85 


0.14 


452825 


AI921523 




gb:wo26d09.x1 NCI_CGAP_Gas4 Hcmo sapiens 


1 


1 


452831 


AW864089 


Hs.135145 


ESTs 


2.4 


0.19 


452846 


AA082160 


Hs.204295 


ESTs 


8.9 


0.08 


452850 


H23230 


Hs.22481 


ESTs 


4.75 


0.14 


452659 


A1300555 


Hs.268158 


Homo sapiens cDNA: FU23591 fe, clone L 


9.15 


0.08 


452862 


AW378065 


Hs.6687 


ESTs 


5.95 


0.07 


452899 


M96739 


Hs.30956 


Human NSCl-1 mRNA sequence 


1.04 


0.9 


452902 


AI926501 


Hs.249729 


ESTs 


6.8 


0.1 


452909 


NM_015368 


Hs.30985 


pannexinl 


5.6 


0.1 


452931 


AW190011 


Hs.1 58006 


hypothetical protein 


1 


0.53 


452934 


AA581322 


Hs.4213 


ESTs 


1.44 


0.55 


452956 


AW003578 


Hs.231872 


ESTs 


1 


0.22 


452974 


BE09Q803 


Hs.61506 


ESTs 


1.75 


0.18 


453011 


N62952 


Hs.46473 


ESTs 


1 


1 


453050 


AW136479 


Hs.224046 


ESTs 


1 


0.39 


453074 


AA031813 


Hs.271880 


ESTs 


1 


1 


453076 


AI978583 


Hs.232161 


ESTs 


175 


0.14 


453123 


AI953718 


Hs.221849 


ESTs 


&6 


0.11 


453134 


AA032211 


Hs.118493 


ESTs 


1.68 


0.42 


453135 


T07866 


Hs.31834 


Homo sapiens clone 25129 mRNA sequence 


1 


1 


453137 


AJ954733 


Hs.223640 


ESTs 


1 


0.51 


453144 


AW268807 


Hs.61646 


ESTs 


1 


0.26 


453153 


N53893 


Hs.24360 


ESTs 


5 


0.13 


453156 


BE463762 


Hs.223784 


ESTs 


2.8 


0.15 


453204 


R10799 


Hs.191990 


ESTs 


9.5 


0.05 


453228 


AW628325 


Hs.232327 


ESTs 


1 


1 


453274 


AA018511 


Hs.32769 


Homo sapiens mRNA fufl length insert cON 


1 


1 


453293 


AA382267 


Hs.1 0553 


ESTs 


a4 


0.09 


453321 


A1984381 


Hs.232521 


ESTs 


&7 


0.1 


453329 


T97205 


Hs.17998 


ESTs 


8.9 


0.08 


453389 


BE273S48 


Hs.32963 


cadherin 6, type 1 K-cadherin (fetal Id 


1 


0.18 


453437 


H10751 


Hs.79981 


Human clone 23560 mRNA sequence 


1 


0.83 


453450 


AW797627 


Hs.89474 


ADP-ribosylation factor 6 


7.09 


0.08 


453459 


BE047032 


Hs.257789 


ESTs 


2.35 


0.3 


453476 


A1640500 


Hs.24633 


SAM domain, SH3 domain and nuclear local 


2.75 


0.16 


453651 


AA971698 


Hs.159397 


x 010 protein 


8.95 


0.08 


453653 


AW505554 


Hs.300284 


ESTs 


4.6 


0.1 


453775 


NMJJ02915 


Hs.35120 


replication factor C (activator 1) 4 (37 


3.4 


0.1 


453776 


R15749 


Hs.31677 


ESTs 


1 


1 


453846 


AL157586 




gb:DKFZp761H0216_rl 761 (synonym: hamy2) 


1 


0.95 


453884 


AA355925 


Hs.36232 


K1AA0186 gene product 


10.25 


0.06 


453900 


AW003582 


Hs.226414 


ESTs, Weakly similar to ALU8.HUMAN ALU S 


4.75 


0.12 


453913 


AW004683 


Hs.233502 


ESTs 


3.65 


0.14 


453925 


AW021088 


Hs.181614 


ESTs 


3.7 


0.13 


453931 


AL121278 


Hs.25144 


ESTs 


3.45 


0.18 


453945 


NM.005171 


Hs.36908 


activating transcription factor 1 


6.35 


0.12 


454032 


W31790 


Hs.194293 


ESTs 


6.15 


0.07 


454049 


AW022885 




gb:df45e05.y1 Morton Fetal Cochlea Homo 


2.8 


0.15 


454069 


AW025160 


Hs.34161 


ESTs. Moderately simitar to ALU 1 .HUMAN A 


1 


0.32 


454099 


AW052974 




gb:IL1-ST0041-02089WX)1-H08 ST0041 Homo 


1 


1 


454111 


AW081681 


Hs.269064 


ESTs 


2.8 


0.18 


454219 


X75042 


Hs.44313 


v-rel avian relicuIoendotheBosis viral 


9.4 


0.05 


454259 


AL110136 


Hs.47679 


Homo sapiens mRNA; cDNA DKF2p564l112 (fr 


6.2 


0.11 


454327 


BE064097 




gb:QV3^T0297-231 199420-hQ8 BT0297 Homo 


1 


1 


454331 


AW372937 




gb:Qv^BT0381-161299-042-a09BTtB81 Homo 


1 


0.43 


454380 


AW858722 




gb:RC3-CT0347-281 1994)1 1-c04 CT0347 Homo 


1 


0.29 
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454524 


AW857191 




9b:RC2.CT0304-080100^11-b12CT0304 Homo 


10.55 


0.08 


upregulate stage 


454592 


AW810112 




gb:MR4-ST01 24-100400O06-e07 ST0124 Homo 


1 


0.37 


upregutate stage 


454648 


AW811960 




gbKC2-ST0168-24030(W)17-f09 ST0168 Homo 


1 


0.4 


upregulate stage 


454687 


AW814473 




gb:MR3-ST0203-010200-109-c11 ST0203 Homo 


1 


1 


upregulate stags 


454692 


AW813350 




gb:MR3-ST0192-1001(XH)24-gQ7 ST0192 Homo 


4.45 


0.14 


upregulate stage 


454702 


BE 145915 




gb:MRO-HT0208-221299-204-h08 HT0208 Homo 


9.65 


0.08 


upregulate stage 


454729 


AW817003 




gb:QVO^T0247-04010tW»1-f03 ST0247 Homo 


1 


0.8 


upregulate stage 


454789 


BE156314 




gb;QVO-hT0367-15020(Ml4-d02 HT0367 Homo 


1 


0.31 


upregulate stage 


454797 


BE161168 




gb:PMO-HT0425-170lOO-002-a10 HT0425 Homo 


4.1 


0.14 


upregulate stage 


454863 


AW835610 




gb:QV4-LT0016-09020U100-c02 LT0016 Homo 


1 


1 


upregulate stage 


454893 


AW837753 




gb:CM1-LT0042-31010GM12-g03 LT0042 Homo 


4.45 


0.18 


upregulate stage 


454898 


AW838125 




gb:QV2-LT0051-24030(W97-e12 LT0051 Homo 


1 


1 


upregulate stage 


454951 


AW847464 




gb:RC3-CT0208-27099W)21-h12 CT0208 Homo 


7.4 


0.1 


upregulate stage 


454956 


AW847725 




gb:IL3-CT021 3-1 80200-041-H10 CT0213 Homo 


1 


0.23 


upregulate stage 


455047 


AW859530 




gb:PM1-CT0243-071099-001-g06 CT0243 Homo 


4.25 


0.13 


upregulate stage 


455128 


AW8S1555 
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gb:RC2-CT0321 -1 10100-01 3-b05 CT0321 Homo 


2.9 


0.13 


upregulate stage 


455201 


AW947884 




gb:PM1-MT(W10-200300-001-g08 MT0010 Homo 


3.15 


0,16 


upregutate stage 


455207 


AW994394 




gb:RC3-BN0036-060400^014-h12BN0036 Homo 


1 


0.18 


upregulate stage 


455331 


AW897292 
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ESTs 


2.6 


0.2 


upregulate stage 


457982 


AW856093 


Hs.183617 


ESTs 


1 


0.25 


upregulate stage 


458080 


BE142728 




gb:MRO-HT0157-02129WKMKi08HT0157 Homo 


2.05 


0.27 


upregulate stage 


458340 


AJ457102 


Hs.121583 


Human glucose transporter pseudogene 


2.25 


0.18 


upregulate stage 


458440 


AI095468 


Hs.1 35254 


ESTs, Weakly similar to thrombospondin t 


235 


0.13 


upregulate stage 


458771 


AW295151 


Hs.163612 


ESTs 


1 


0.19 


upregulate stage 


459092 


AA722012 


Hs.255757 


ESTs, Weakly simitar to K1AA061 1 protein 


6.95 


0.1 


upregulate stage 



TABLE 1B 



Pkey: Unique Eos probeset identifier number 
CAT number. Gene cluster number 



Accession: 


Genbank ac 


session numbers 


Pkey 


CAT 


Accessions 


407774 


101538 1 


AA084958 AA214034 AA044587 AA130152 AA1301 16 D81 924 


407939 


1027688J 


W05608AW118352AW196215 


407980 


103087J 


AA046309 AI263500 AA046397 


408224 


1048369.1 


AW175997 AW176000 AW175999 AW175994 AW176004 AW175989 


408241 


1048867J 


AW176546 AW178965 BE141057 


408268 


104987.1 


AL138247 AA053529 BE173313 


408277 


1050131J 


AW177959 AW177960 AW178066 AW178022 


408306 


1050863 1 


BE141991 AW1 78855 BE141990 BE141993 BE141994 BE062405 BE062404 AW178850 


409281 


111502J 


AA069998 AA075659 AA07566O AA069828 


409408 


1128045J 


AW387837 AW387874 AW387826 AW387758 AW387864 AW809268 AW387754 


409566 


114012 1 


AA078899 AA078782 AA075788 


409760 


115373J 


AA302840 T93016 T92950 AA077551 


409794 


1154785J 


AW885691 BE165989 BE 185940 AW500322 H10282 


409810 


1155339 1 


AW500895AW937549 


409840 


1156071.1 


AW502122AW502125AW501663AW501720 


410071 


1174764J 


AW582568 AW81 8656 AW81 8547 AW81 8655 AW81 8637 AW818234 


410128 


117773 1 


AW904599 AA325920 AW964919 AA0817B6 T05150 


410269 


1189983 1 


AW613597 BE080235 BE080168 


410475 


1204788 1 


AW749927 AW749938 AW749986 AW749966R87124 AW860547 


410495 


1205626 J 


N9S428W24040AW751366 H81987 


410520 


1206965J 


AW752710BE180336BE1801B6 


410534 


1207247J 


AW905138 AW753008 R13818 Z43519 


410537 


1207336 f 


AW753108 AW852909 N36993 AW894327 AW903629 


410560 


1208311 1 


N29220 AW754225 AW754224 


410562 


1208415 1 


AW858528 AW754362 H74000 


410724 


1218202.1 


AW799269 AW799385 AW799501 AW799386 AW799270 AW799467 


410751 


121964J 


AA35791B AA089516 AA357917 


410754 


1219733 1 


T63840AW801569AW801568 


410785 


1221055.1 


AW803341 AW803265 AW803403 AW803466 AW803402 AW80341 3 AW803268 AW803396 AW803334 AW803355 


410844 


1223878.1 


AW807073AW807055AW807067AW807276AW807030AW807353AW845892AW807W 






AW807195AW807271 


410910 


1226517.1 


AW810204 AW810555 AW810196 AW810619 AW810507 


410973 


1228236.1 


AWB12278AW812286AW812274 


410997 


1228736.1 


AWB1 2877 AW81 2878 AW81 2952 AW81 2880 


411110 


1232598.1 


H93000 AW81 8022 AW818044 


411112 


1232692.1 


AW818158 AW865743 AW865702 AW865698 


411132 


1233580.1 


AW819191AW819252AW819183AW819175AW819177AW819186AW819180BE158470AW819242AW819269A^ 
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411137 
411157 
411159 
411162 
411170 
411173 
411193 
411242 



411245 
411282 



411327 
411336 
411339 
411407 
411418 
411425 
411461 
411474 
411526 
411560 
411568 
411691 
411727 
411741 
411771 
411787 
411788 
411826 
411640 
411860 
411880 
411932 
412134 
412178 
412296 
412303 
412305 
412312 
412327 
412354 
412357 
412359 
412367 

412389 
412547 
412559 
412671 
412744 
412838 
413119 
413141 
413266 
413294 
413342 
413402 



413485 
413508 
413618 
413707 
413743 
413782 
413792 
413804 
414091 
414203 
414220 
414276 



1233775 1 
1234099.1 
1234102.1 
1234161 1 
1234379 1 
123446.1 
1235254.1 
1236393.1 



1235412.1 
1237660.1 



1238874.1 

1239216.1 

1239387.1 

1244071.1 

124518.1 

1245503.1 

1246730.1 

1247047.2 

1248748.1 

1249443.1 

1249776.1 

1254166.1 

1255052.1 

1255976 1 

1257335.1 

1258789 1 

125879.1 

1260006.1 

1260676 1 

126168.1 

1263110.1 

12661 25 J 

1279147 1 

1281471.1 

1288043.1 

1288130.1 

1288141.1 

1288349.1 

1288935.1 

1290342.1 

129061 1J 

129085.1 

1291505.1 

1292588.1 

1305813.1 

1306011.1 

1319959.1 

1324622.1 

1331178.1 

1349658.1 

1350477.1 

1356260.1 

1359149.1 

136279.1 

1366932.1 



413445 1370833.1 



1372953 1 

1374273.1 

1380384.1 

1384137.1 

13S5884J 

1389068.1 

1389912.1 

1390710.1 

141621.1 

1425510.2 

1426940.1 

1432115 -1 



AWB19265 AW819268 AW819246 BE152602 AW819249 AW819251 AW819253 AW819194 
AWB19455AW819711 AW819553 
AW819867 AW819956 AW820159 

AW820178 AW8201B3 AW819870 AW819937 AW835709 AW820184 AW820157 AW819945 AW820096 
AW819944 AW8201 82 AWB201 68 AW81 9959 AW81 9953 
AW820503 AW820306 AW820429 BE174741 AW820244 BE174738 
R81571AW962014AA324277 

AWB21484 AW821461 AW821490 AW821525 AW821526 AW821519 

BE146808BEl46949AW833828BE147()02BE147066AW833422AW833475AVra33485AVV833414AW833406AW 

AW833835AW833421 BE 1 46805 AW833465 BE 1 46753 BE1 47004 AW833473 AW833573 BE14706B AW833466 AW833680 BE1 47063 BE147058 

BE146807 AWB33830 AW833694 AW833596 AW833601 

AW833441 AW833552 AW833700AW833610 AW833673 AW833675 

AW99501 1 AW88063O AW995662 AW8801 96 AW860455 AW995379 AW880834 AW938253 AW880580 AW938263 AW880584 AW860454 
AW880603 AW880026 AW938265 AW880592 AW852203 AW880588 AW880680 AW995392 AW880598 AW860603 AW880590 AW880175 
AW846534 AW880098 AW880108AWB6O453 AW938051 AW995191 AW880532AW880542AW938197 AW994929AW880635 AW83543B 
AW938057AW938054 

AW836922 AW876719 AW876688 AW836919 AW835997 AW836908 AWB36912 AW836993 
AW837675 AW837356 AW837581 AW837348 AW837681 
BE164598 AW837578 AW837582 
R00903 R20424AW844169 

BE241870 AI752584 Z19223 AA307961 AA1B1 165 AA093272 AW368570 

AW84601 2 AW846007 AW845996 AWB45975 

AW847937 AWB47672 AWB47809 AW847923 AW847808 AW84761 5 

AW848427 AW848890 AWB481 59 AW8481 1 8 AW848634 AW848285 AWB48086 AW848485 AWB48283 AW8481 62 

AW850327 AW850350 AW850348 AW850375 

AW8511B6AW996967BE143456 

BE144593 AW851632 AW968149 AW968144 W21059 

AW857199 BE066018 BE065982 BE066082 BE065949 BE066019 BE066081 BE065954 BE066084 

AW858443 AW858436 AW861923 AW858551 AW861872 AW858474 AWB61904 AW858398 

AW859650 BE071806 AW859655 

AW993247AW861464 

AW863568 BE161696 BE161629 BE161824 

AW897793R20054H23321 

AW947946 AW947986 AW947977 AW8651 87 AW865246 AW8551 82 AW865376 AW865392 AW8651 84 

AW866330 AW866410 AW866332 AWB6641 1 AW866327 AW866326 AW866335 AW856334 AW866331 AW866333 AW866328 AW866329 

T89420 N87510AA095932 

AW872477BE088101T05990 

AW876548 AW876577 AW876528 AWB76623 AW87651 9 AW876540 AW876569 AW876534 AW876563 AW876545 AW876574 

AW895560 AW895557 AW895314 AW895537 AW895287 AW895411 

AW898526AW89B525Z19700 

AW936233AW936272 

AW936336AW936339 

AW936369 AW936377 AW938355 AW936410 AW936460 

AW936686 AW936656 AW936608 AW936635 AW936652 AW936679 AW938726 AW936602 AW936733 

AW937355 AW937450 BE162340 BE162397 

AW939146AW939200BE161819 

AW939537AW939584 W92705 

AW837985 AW837938 AA101 955 AW837913 AW837935 

AW9459S4 AW946020 AW946034 AW946027 AW946041 AW946044 AW946033 AW946024 AW946021 AW946029 AW94601 5 AW94601 6 

AW946039 AW946045 AW946028 AW946036 

AW947655AW98402O 

W27161AW961828 

T31474 T31 123 Z45146 AW962040 

AW977734 H62048 

N31101 N46491 AW994084 

D61870D61866BE0O2674 

BE065941 BE066098 BE066070 BE065997 BE066003 
BE16S323 BE067045 

BE300352 BE299274 BE075351 BE297444 
BE144034 K72932BE080176 
AA128535 AW973587 AA553822 

T24065 8E092527 BE092528 BE092204 BE092271 BE092516 BE092202 BE092280 BE092065 BE091991 BE092284 BE092514 
BE091995 BE092063 BE092068 BE091994 BE092283 8E092447 

BE141022 BE141513 BE141532 BE 141488 8E141481 BE141526 BE141516 BE141471 8E141478 BE141476 BE141475 BE141021 

BE141533 8E141464 BE 141490 BE 141472 BE 141480 

N52628 BE143639 BE143706 

BE145354 BE145429 BE14541B 

BE154078 BE154006 BE153891 BE153523 BE153983 

BE158679BE158714 

BE161004 BE162497 BE161135 BE160999 BE162492 
BE546104 BE165710 BE382801 

BE166924 BE1 66921 BE166925 BE16691 5 BE166970 BE166968 

T64682BE168190 BE168256 

T83742AW672882AA135378 

BE262170 BE382553 BE261026 BE273627 

BE298094 BE267860 

BE297862 
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414366 


1438636J 


BE549143 BE390613 BE277344 


414833 


1496271.1 


T07114 BE543888 


414985 


151 121 5 J 


C17372 D63180 O78908 D63262 


415027 


1515978 1 


D31010 D30991 D31168D31166D31465 


415104 


1522649 J 


D6D076D60259 D61037 


415114 


15230351 


D60468 060950 061218 K98936 


415178 


1527354J 


D80503 D81739 N9S691 


415295 


1533528 1 


F05086 F05091 R17158 


415303 


153380 1 


R1 1813 AA164376 T56632 


415352 


1S34744_1 


F06565 Z43466 R1B417 F06477 F06476 F07098 


415364 


151500fl~i 


F06771 H04B95 Z42778 


415371 


1 9 09U0D. | 


R 15239 Z45189 F0683S 


415392 


1R1574R 1 

1909/fD.I 


7440R7 F07617 R145S5 
t/rtuuf ru/uii rwwjj 


415569 


1110011 1 
iiM77 1 I.I 


741010 F12170 H 10981 T857S1 


415572 


1CKXK1 1 
1 90979 1 _ 1 


F19904 F19711Tfifi979 TRR9fi1 


41560D 


1540373 1 


F1 9RR4 T7411 2 NRQ11 R 79fl01Q 


415626 


15407W1 
I9W/ 00_l 


741R47 F110RR 171111 


415635 


IHflR^I 1 
lO*»UOiKJ_l 


F111fifl R912R0 T7762R 


415750 


1 991 I 0_ 1 


AA1R7719 AWQ7S074 AW1R441R AW7A444R 

MA (0 1 1 \ 4 /4vVaJ0v44 ATTjO^tOO rtViOO*t*fnO 


41 5773 


1KR4447 1 
139'Wt/_l 


R91R51 H00549 


415822 


1CC7Q4 \ 


nqcnii nfi1?n? AA1R071R 


415948 


156264 1 


AA2R227R AIQB417S AA171R07 


416173 


1574071 i 


R597R9 R17313 W941Q9 R10R7R 


416343 


1 JO JCKKJ | 


H40911 W0077S K49308 WOO 776 


416395 


1CQ9777 i 


R04*i75 TOORRfi HS2980 


416624 


1R0AR0A 1 


H60044 T471S7 H7K91 T50292 


417304 


166556 1 




417396 


1K744RC i 

1 0/ M*tC9_l 


T0S0R7 TQR744 M7830R 

1 309UI 1 3U / tt 1 N / DvU ii 


417742 


1RQR7fl2~1 

1 U9Q£0£_ 1 


RR4719 744580 R12451 


417974 


171237 1 


AA91Q7RR TOS700 H94407 
n/uiuiQj I39iUU n3*»*TU/ 


418297 


1736343 1 


R019S4 T071*ifi R5871 1 


418454 


175699 1 


AA11 53nfl AA92139? RF»i38098 BE087173 


418498 


176296? 1 


T78248 T887S3 R93361 


418546 


176677 1 


AA224827 T597Q8 T59843 BE1 56903 


418573 


176907 1 


AA9951 Rfl AL 1 57508 Al 1 57509 AA2251 RQ 


418636 


177402-1 


AW7498S5 AA995Q05 AW750208 AW750206 


418712 


1784125 1 


749 1 81 T11 fi?1 T97478 


418948 


180808 T 


AI917007 AWRflfiOOfl W180.15 W38709 AA919fl35 AW0l1R043 


4100^0 

t 13U 93 


1R1R01R 1 


Tflfi?16 TRR117 TB331R 

IDUiLIU I QUI If IDiMIO 


419145 


182217 1 


M99638 AW973750 AA328271 H 90994 AA558020 AA234435 N59599 R94815 


419472 


185148 1 


AW078ni8 AAR04?n4 AA2434nn 

nVV3f wU9v /WJUtiwH /Vi**t«*HvU 


41 04 77 


1 fit; 179 1 


AAR9R97Q AA94149R AW071R14 AWQR7R/K AW071fifW 
r\r\OZOt/ 3 /v\**T9*itO M»»yf ID IS rtVTSO/ 0U9 ftifJ/ I QUO 


419716 


167390 1 


A4JVL?77n AA 94 0001 


41QRfi^ 


1RR24R 1 


AW0RRO4<; AA9VWR4 HQ 1711 AWQRR041 
ATiyooyio /WiOusoH ny i / oo Hvryooyt i 


/t10R07 

h iyou* 


100404_1 


FJ774D9 AA7R94R9 AA7410RR RAR704 


A1QR14 


1 RRIRfi 1 
10OOOO.1 


AA911 1 1Q AWQR74RS AA9 I i1?fW 
AM49I 1 Oy AVT30/409 rtMZO \£\JH 


<1 13304 


1 03 1 93_ I 


AA7RKQ4 AllR^fM^ AA9m00fi 

rvitu 1 9y*l MIO09W0 nAZJ 1330 


41QQQR 


ioy/oo_i 


AA9 ( t9RQ1 AW90R1fifi AAfiRRflRI 


i901 11 
44\M I I 


I3U/ 99. 1 


AAJ'iRfiW AA9R0011 AW0R7090 AA9R9RR4 

M/\£9909t rWt0v3 I 1 nVVyor 3£U rvV£utOOt 


490970 
44U4/U 


109170 1 


a AOCTQgn AWR1R4Rn AI41R0R1 AW*WW71 


4AID4J 


I yo/yo_ i 


W/R7711 A4270nn4 44770001 


44U340 


107791 1 

iy* i 40_i 


AA9R1CnQ U0O4R7 KI4Ci;i7 


y!919fln 


4UU0OO_l 


A&R1 1RH4 A AR 00404 AA9RR0fl7 AW077R94 


x9hir 

44IOOO 


90117R 1 

4U 10/ 0_1 


AA9R7441 AA4101R t » RF0R407R AI47A147 


491 1R1 
It 1 JO 1 


201903 1 


A A1R1 7<i7 AW0R177R AA7RRH1 7 
n/Uu I / 9c r\»f 309* / o rvuoou I • 


49141 R 


202288 1 


AAR0RR10 AA9010HR AAR1R974 AWP7RRDR 


4/ 1 / UO 


905719 1 


AW7«I4141 AWRW490 AWRRR47R AWRR1QRQ AWRR1Q9R AWflRR*?^ AWRRRR1R AWRRjU'tf) AWR^fl'vdS 
flVVi nVVOOO*rtU MiVOOO't/ 9 /AiVOO J3D3 MVVOO 13/9 r\» 100009** MW09O0 10 MVT090H9U MTVD909H9 


491 rii 

4Z 10 10 


207654 1 


RF04R9W AA1110R1 AA9QR410 
DCiHacOO MAO lOUOO MMt30HI3 


i* 99904 
4444U4 


213258M 


AAllQOm AWQR9Q4R AAlOfilftfl 
MMOOyU 1 0 MYItSbZSWO rvUUD IOO 


499^49 


215498 1 


AA10O979 AA10Q119 AW0R1R17 

Mrt9v3c/ £ rV*9U3«J It /\F¥9Q IDOI 


VI9949Q 
44444y 


21 64S9~1 


AA110A97 AW0fi99Qt; 744RRR H0RR41 
MMO 1 1/04/ Mr V304439 tWODO nUOO*r 1 


>!99<y\4 
4443W 


217160 1 


AA111407 AW0RR191 M91RR1 R700S0 
MMOt I1U/ M¥»3909t 1 rl£9909 r\/UU9U 


422588 


218192J 


AA312730 AW953285 W28250 W27318 


422760 


221034J 


BE409561 BE162756AW732798 


422834 


221906.1 


AA318334 AW961457 AA317752 


423100 


224988,1 


AA323114 AA321992BE161391 BE161392 


423121 


225175,1 


AWB64848 AA32221 3 AA322209 AW961624 


423368 


227560J 


AA364195 AA325029 AW962050 


423632 


230430,1 


AA328824 AW961410 AW958096 


423544 


230585,1 


AA329048 AW962484 T83628 


42381 B 


232362^1 


AA332439 AA332579 AA331470 AW962539 


423837 


232478,1 


AW937053 AW937056 AW937062 AW937059 AW937057 AA331 599 AW937061 AW937093 AW937094 AW937058 AW85 1 327 BE082872 






AW752885 AW937055 AW937054 


423912 


233241.1 


BE091233 BE080904 BE079414 BE075356 AA332482 


424353 


238398.1 


AA339M6AA339189AW966286 


424589 


241202,1 


AW854298 AA343691 H50917 H50907 H50938 


424951 


245273.1 


AW964082 AA348838 AA348839 
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424993 
425360 
4253S2 
425672 
426358 
426475 
426561 
426603 
427259 
427565 
427980 
428042 
428192 
428337 
428436 



429236 
429917 
429979 
430076 
430350 
430561 
430640 
430795 
431319 
431322 
431670 
431828 
432093 
432222 
432724 
433005 
433108 
433129 
433171 
433311 
433371 
433440 
433585 
433782 
434006 
434092 
434138 
434280 
435382 
436023 
436359 
436383 
438740 
436884 
437087 
437146 

437214 
437240 
437251 
437274 
437324 
437471 
437798 
437854 
437938 



245782.1 
250631J 
250655.1 
254716J 
265381.1 
267785.1 
269158J 
269825.1 
276594.1 
280401.1 
285225.1 
286292.1 
288021.1 
289967.1 
291472.1 



301580.1 

31049.1 

311517.1 

312952.1 

316401.1 

319932.1 

321094.1 

323756.1 

331474.1 

331543.1 

336353.1 

338201.1 

341283 1 

343347 1 

35330 1 

357346.1 

35396.1 

359479.1 

360292-1 

362908.1 

364430.1 

36629 -1 

370470.1 

37414 1 

37855.1 

379978.1 

380572.1 

382816.1 

405265.1 

413562.1 

41847J 

41888 1 

426095.1 

428731J 

432797.1 

43371.1 

434730.1 

435139.1 

435242.1 

435465.1 

43609.1 

43747.1 

443128.1 

44418.1 

44573.2 



438316 
438366 
438390 
438739 
438990 
439183 
439330 



454745.1 

456119 1 

45662.1 

464193.1 

46760.1 

46956.1 

47119 1 



F07625 R351 15 AW9531 15 F05102 H13351 AA349497 F11152 AA707254 

BE547704AA355909 

AA355936AI741379 

AA361483AI557239 

BE536836AA376153 

AL134728 AA379975 AA379440 AW963931 

AA38 1437 AA628833 AW407275 

AA382291 AA994657 

AA400096AA400167 

AI74351 5 AA40561 7 AW276706 

AA418305AI264351 

AA419529H97089 H95977 

AA424051 AW104616 

AA644508 AA4794S9 AA4261 74 

8E080180 AW827313 AW231970 AA995028 AA426584 AW872716 AW892508 AW854593 AA578441 AW975234 AA664937 AA984131 
AA528743 AA552874 AA564758 AW053245 AI267534 AW0701 90 AW893483 AA770330 AA90692B AA906562 AA758746 AA551 71 7 AW06331 1 
AA42953B 

AA448407 AA448404 AA44B424 AA448426 
H80572AL109691 H8O520 
AA463338 M86137 M85945 A1040226 
AA4651 15 AW967750 AW869525 
BE169639AA476976 

BE065227 BE065366 BE065182 BE065184 AA481239 
AA482636 R16335 R16343 R60706 R53647 
AW971398 AA507162 AA486494 
AA873350 T82429 T8242B 

AW970622 AA503009 AA502998 AA502989 AA502805 T92188 

AW971287 AA524976 AA513479 

AA572994 AA516249 AA702595 

H28383 AW972670 H28359 AA525808 
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1.000 


439482 


W70045 


Hs.58089 


ESTs 


0.118 


439606 


W79123 


Hs.58561 


G protein-coupled receptor 87 


0.095 


439706 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAP 1 .HUMAN DEATH 


0.055 


452240 


AI591147 


Hs.61232 


ESTs 


0.221 


452316 


AA298484 


Hs.61265 


ESTs, Moderately similar to G786.HUMAN P 


2.595 


452747 


BE153855 


Hs.61460 


lg superfamUy receptor LNIR 


3.677 


434876 


AF160477 


Hs.61460 


tg superfamRy receptor LNIR 


7.587 


444783 


AK001468 


Hs.621B0 


anillin (DrosophDa Scraps homdog). act 


0.046 


438779 


KM_003787 


Ks.6414 


nucleolar protein 4 


0.030 


439453 


BE264974 


Hs.6566 


thyroid hormone receptor tnteractor 13 


1Z016 


440126 


AA975145 


Hs.66194 


ESTs ' 


0.008 


451291 


R39288 


Hs.6702 


ESTs 


0.012 


439963 


AW247529 


Hs.6793 


platelet- activating factor acetylhydrola 


1.653 


440006 


AK000517 


Hs.6844 


hypothetical protein FU2Q510 


8.628 


437044 


AL035864 


Hs.69517 


cDNA for differentially expressed C01 6 g 


140.908 


418107 


R41726 


Hs.7284 


ESTs 


0.146 


436326 


BE085236 


Hs.75313 


aldo-keto reductase family 1, member B1 


0.649 


433675 


AW977653 


Hs.75319 


ribonucleotide reductase M2 polypeptide 


0.237 


414416 


AW409985 


Hs.76084 


hypothetical protein MGC2721 


2.242 


414430 


A1346201 


Hs.76118 


ublquitin carboxyMerminal esterase LI 


0.202 


414682 


AL021154 


Hs.76884 


inhibitor of DNA binding 3, dominant neg 


2.318 


414807 


AI738616 


Hs.77348 


hydroxy prostaglandin dehydrogenase 15-(N 


1.622 


414907 


X90725 


Hs.77597 


pcrfo (Orosophia)-fike kinase 


246.564 


451575 


AA767622 


Hs.78893 


KIAA0244 protein 


1.000 


400303 


AA242758 


Hs.79136 


UV-1 protein, estrogen regulated 


4.552 


416498 


U33632 


Hs.79351 


potassium channel, subfamily K, member 1 


5.128 


458921 


AI682088 


Hs.79375 


tiotocarboxyiase synthetase {biotav[prop 


0.245 


409235 


AA188827 


Hs.7988 


ESTs, WeaWy similar to 138022 hypclhefi 


7.249 


440371 


BE268550 


Hs.80449 


Homo sapiens, clone 1MAGE:3535294, mRNA, 


0.792 


452732 


BE300078 


Hs.80449 


Homo sapiens, clone (MAG&3535294, mRNA, 


360.782 


417003 


AL03B170 


Hs.80756 


betalne-homocysteine methyl transferase 


60.590 


407584 


W25945 


Hs.8173 


hypolhetical protein FU10803 


9.988 


417312 


AW888411 


Hs.81915 


leukemia-associated phosphoprotein p18 ( 


402.705 


417389 


BE260964 


Hs.82045 


midkine (neurite growth- promoting factor 


10.806 


428839 


A1767758 


Hs.82302 


Homo sapiens cONA FU14814 fis, clone NT 


1.051 


438315 


R56795 


Hs.82419 


ESTs 


0.226 


417900 


BE250127 


Hs.82906 


CDC20 (cell division cycle 20, S. cerevi 


26.260 


417933 


X02308 


Hs.82962 


Ihymidylate synthetase 


221.090 


418067 


AI127958 


HS.83393 


cystafinE/M 


Z396 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



438086 


AA33S519 


Hs.83623 


nuclear receptor subfamily 1 , group I, m 


1.000 


41 8205 


L21715 


Hs.83760 


troponin 1, skeletal, fast 


0.159 


413385 


M34455 


Hs.840 


indoleamine-pyrrde 2,3 dioxygenase 


0.490 


418322 


AA284166 


Hs.84113 


cycfin-dependent Kinase inhibitor 3 (COK 
interleukln 8 receptor, beta 


1527 


413529 


U11874 


Hs.846 


a077 


458027 


L49054 


Hs.85195 


myeloid leukemia factor 1 


0.008 


418543 


NM 005329 


Hs.85962 


hyaiuronan synthase 3 
hypothetical protein 


1.813 


418583 


AA604379 


Hs.86211 


125.769 


441801 


AW242799 


Hs.86366 


ESTs 


55.026 


414792 


BE314949 


Hs.87128 


hypothetical protein FU23309 


ai39 


407246 


S70348 


Hs.87149 


integrin, beta 3 (platelet glycoprotein 


0.020 


433417 


AA587773 


Hs.8859 


Homo sapiens, Similar to R1KEN cONA 5830 


313.141 


445080 


AA830811 


Hs.88808 


ESTs 


1.000 


453450 


AW797627 


Hs.89474 


ADP-ribosytation factor 6 


137.718 


419227 


BE537383 


Hs.89739 


cholinergic receptor, nicotinic, beta po 


0.006 


401464 


AF039241 


Hs.9028 


hlstona deacetylase 5 


6.846 


443162 


T49951 


Hs.9029 


DKFZP434G032 protein 


14.057 


431024 


AA7 13666 


Hs.90462 


Homo sapiens, clone IMAGE:4 132043, mRNA, 


3.507 


419559 


Y07828 


Hs.91096 


ring finQer protein 


0.025 


419741 


NM_007019 


Hs.93002 


ubiquith carrier protein E2-C 


684.577 


443426 


AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


0.363 


424457 


AI249036 


Hs.94292 


hypothetical protein FU 23311 


175.667 


410348 


AW1 82663 


Hs.95469 


ESTs 


0.011 


439738 


BE246502 


Hs.9598 


soma domain, immunoglobulin domain ((g). 


428.231 


421478 


A1683243 


Hs.97258 


ESTs, Moderately similar to S29539 ribos 


0.005 


443767 


BE562136 


Hs.9736 


prole asome (prosoma, macropain} 26S subu 


1.168 


426902 


A1125334 


Hs.97408 


ESTs 


37.457 


444874 


A1218496 


Hs.97515 


BRGA1 -interacting protein 1; BRCA1-assoc 


0.067 


427356 


AW023482 


Hs.97849 


ESTs 


1.000 


430000 


AW205931 


Hs.99598 


hypothetical protein MGC5338 


0812 


419485 


AA489023 


Hs.99807 


ESTs, Weakly similar to unnamed protein 


0.342 


420783 


AI659838 


Hs.99923 


lectin, galactoside-blnding, soluble, 7 


19.785 


421934 


AA300625 




gb:EST13476 Testis tumor Homo sapiens cD 


103.769 


431322 


AW970622 




gb:EST382704 MAGE resequences, MAGK Homo 


0.258 


424040 


AA334400 




gb:EST3B610 Embryo, 9 week Homo sapiens 


0.011 


433108 


AB002446 




gbiKomo sapiens mRNA from chromosome 5q2 


0.023 


458829 


AI557388 




gb:PT2.1JLG03-f tumor? Homo sapiens cON 


1.000 


459169 


A1905517 




gb:RC-BT091-210199-105 BT091 Homosapien 


0.773 


400300 


X03363 




HER2 receptor tyrosine kinase (c-erb-b2, 


468.462 


440012 


AA861072 




gb:ak32e05.s1 Soares_testis_NHT Homo sap 


0.002 


412799 


AI267608 




gbaq91h03j(1 StanleyFrontai SB pool 1 


0.010 


412964 


BEO 19688 




gb:bb28g08.x1 NIH_MGC_5 Homo sapiens cDN 


0.003 


406992 


S82472 




gbrbela -pof=DNA polymerase beta {exon a 


0.005 


414969 


C16195 




gb:C16195 Qontech human aorta poJyA mRN 


0.023 


413158 


BEO68098 




gb:CM1-BT03«H)6l299-060-c09 BT0368 Homo 


0.007 


453823 


All 37967 




gb:DKFZp761D2315_r1 761 {synonym: hamy2) 


0.994 


453846 


All 57586 




gb:DKFZp761H0216 rl 761 {synonym: hamy2) 


0.004 


407055 


X89211 




gb:H.sapiens DMA for endogenous retrovir 


0.037 


415204 


T27434 




gbJibc2294 Human pancreatic islet Homo s 


76.500 


434572 


Af 1 47340 




gb:Homo sapiens fuD length insert cONA 


0.030 


438990 


AF085690 




gb:Homo sapiens full length insert cONA 


1.000 


439780 


AL109688 




gb:Homo sapiens mRNA fuD length Insert 


0.256 


413671 


Z43712 




gb:HSC1JA121 normalized infant brain cDN 


0.009 


406974 


M57293 




gb:Human parathyroid hormone-related pep 


0.004 


455797 


RF091833 




gb:IL2-BT0731-260400-076-F04 BT0731 Homo 


2.616 


455807 


BE 141 140 




gb:MRO-HT0075-021299-006-d07 HTTJ075 Homo 


a413 


432189 


AA527941 




gbmh30c04.s1 NCLCGAP_Pr3 Homo sapiens 


0.015 


443309 


AJ821874 




gbmt58f 1 0jt5 NQ_CGAP_Pr3 Homo sapiens 


0.007 


437240 


AA747537 




gb:nxB5c05.s1 NCLCGAP_GCB1 Homo sapiens 


0.006 


455189 


AW864176 

Ail W*t 1 t V 




gb:PM0-SN0O14-2604Q0-0O2-b08 SN0014 Homo 


0.069 


444163 


AM 36038 




gb:qc54g07jc1 Soares_placenta_8to9weeks_ 


394.282 


HDD\ IV 






gb:QVO-CT03B7-180300-167-h07 CT0387 Homo 


0.757 


454789 


BE156314 




gb:QV0-HT0367-15020O.114-d02 HT0367 Homo 


1.000 


433005 


AW939074 




gfeQV1^T006&O10200-057-c12 DT0069 Homo 


0.013 


455380 


BE160188 




gb:QV1-HT041^01020tW5^g05 HT0413 Homo 


0.249 


455650 


BE064655 




gb:RC1-BT031 3-301 29941 2-c09 BT031 3 Homo 


1.000 


436383 


BE065178 




gb:RC1-BT031 4-020200-01 2-ti01 BT0314Homo 


1.000 


413100 


8E065208 




gbKC1-BT0314-31030(W)15^09 BT0314 Homo 


•271.372 


428436 


BE080180 




gb:RG4-BT0629-120200-01 1-b10 BT0829 Homo 


0.002 


455831 


BE144966 




gb:RCm0187-201099-Q31<04 HT0187 Homo 


0.011 


434414 


AJ798376 




gbrtr34b07jc1 NCLCGAP_Ov23 Homo sapiens 


293.654 


414221 


AW450979 




gb:Ut-H-Bl3-ala-a-12-0-Ul.s1 «CLCGAP_Su 


0.807 


409488 


AW402825 




gb:Ul-HF-BK0-aac r oMJW)-Ul.r1 NiH_MGC_36 


0.965 


437938 


AJ950087 




gbwq05cO2j(1 NCLCGAPJ0d12 Homo sapien 


2952 


451385 


AA017656 




gb:ze39h01.r1 Soares retina N2MHR Homo 


7.341 


449325 


AA001162 




gb2e48b06.r1 Soares retina N2MHR Homo 


0.004 


413316 


W91931 




gfezh47c01 s\ Soares Jetaljver_spleen_ 


0.004 
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401016 


0.342 


401335 


0.256 


401565 


1.000 


401760 


301.372 


401781 


247.141 


401961 


1.722 


402239 


5.180 


402305 


0.917 


402424 


551.141 


402777 


153.231 


402778 


0.006 


402837 


0.367 


402948 


154.103 


402952 


17.038 




0.196 


403297 


12.744 


403637 


0.304 


403657 


0.032 


404136 


0.008 


404249 


0.065 




1.1UO 


404917 


69.590 


404983 


1.000 


405238 


1.000 


405364 


294.141 


405531 


1.747 


405601 


145.551 


405621 


0.224 


405932 
406117 


1.968 
0.333 


406354 


1.000 


406548 


0.002 


406599 


0.010 


459702 AI204995 


0.449 


TABLE 2B 





Ptey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey 

409488 

412799 

412964 

413100 

413158 

413316 

413671 

414221 

414969 
415204 
421934 
424040 
428436 

431322 
432189 
433005 
433108 
434414 



434572 
436383 
437240 
437938 



CAT Number 
1134791J 
132817 1 
1339278 1 
1349119 1 
1351251 1 
1360169 1 
1382504 1 
142696.1 

1510393.1 
1529407.1 
209339.1 
234659.1 
291472.1 

331543J 
342819.1 
357346.1 
35895.1 
38585.1 



38911.1 
41888.1 
435139.1 
44573.2 



Accession 

AW402825BE544338 

A1267606AA121045AA126521 

BE019688BE144460 

BE065208 BE065224 6E065168 BE065313 

BEO68098 BE068119 BE068083 BE068088 BE068120 BE068155 BE068111 
W91931 W94979 8E081744 

Z43712 BE156729 BE 1 56538 BE1 56731 BE156673 BE156539 BE156674 BE156430 BE156672 BE156675BE156432 BE156541 

AW450979 AA136653 AA1 36656 AW41 9381 AA984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE01 1359 BE011367 

BE01 1368 BE011362 BE0J 1215 BE011365 BE01 1363 

C16195C16230C16211 C16164C16251 

T27434 Z25288 F00323 D82802 R85077 

AA300625 R16859 R 16860 AW898335 W24337 

AA334400 AA334257 AW966124 

BE080180 AW827313 AW231970 AA995028 AM28584 AW872716 AW892508 AW854593 AA578441 AW975234 AA664937 AA984131 AA528743 
AA552874 AA564758 AW063245AI267534AW070190 AW893483 AA770330 AA906928 AA906582 AA758746 AA551 717 AW06331 1 AA429538 
AW970622 AA503009 AA502998 AA502989 AA502805 T92188 
AA527941 A1810608 M620190AA635266 

AW939074 AW939073 BE1 60476 AW939938 AW939206 AW94001 2 AW939076 AA573577 AW750479 AA574383 AW970057 
AB002446T03146 

AI798376 S46400AW811617AVV811616WW)557BE142245AVV858232AW861851AW858362 AA232351 AA218567AA055556AW858231 
AW857541 AW8141 72 H66214 AWB14398 AF1 34164 AA243093 AA173345 AA199942 AA223384 AA227092 AA227080 T12379 AA0921 74 
T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157716 M157719 AA100472 AA100774 AA130756 AA167705 
AA157730AA1 57715 AA053524 AW849581 AW854566 C05254 AW882836 T92637 AW812621 AA206 583 AA209204BE1 56909 AA226824 
AI829309 AW991957 N66951 AA527374 H66215 AA045564 A1694265 H60808 AA 1 49726 AW1 95620 BE081333 BE073424 AW817662 
AW817705AW817703AW817659BE081531 H59570 
AF147340T51948T52029 
BE085173AJ227879 
M747537 BE089068 BE089070 

A1950087 N70208 R97040 N36809 A13081 1 9 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 AA908598 AA251875 
AI820501 A1820532 W87891 TB5904 U71456 T82391 BE328571 T75102 R34725 AA8B4922 BE328517 AJ21 9788 AABB4444 N92578 F13493 
AA927794 AI560251 AW8740SB AL1 34043 AW23 5363 AA663345 AW008282 AA488964 AA2831 44 Al 890387 A1950344 AI741 346 A3 689062 
AA28291 5 AW1 02898 AI8721 93 AI763273 AW1735B5 AW150329 A1653832 AI762688 AA988777 AA488892 A1356394 AW10381 3 AI539642 
AA642789 AAS56975 AW50551 2 A1961 530 AW629970 BE6 12881 AW276997 AW513601 AW512843 AA044209 AW856538 AA1 80009 AA337499 
AW961101 AA251669 AA251 874 A!3 19225 A W205862 AJ 583338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629AW5 13996 
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AA293273 AA989759 N75B28 N22388 HB4729 H60052 T92487 AIQ22058 AA78041 9 AA551005 W80701 AW513456 AI373032 AI564269 F00531 
HB3488 W37 181 W78802 R68056 AI002839 R67840 AA300207 AW959581 T53226 F04005 



438990 


46760 1 


AF085890K29949 H29856 


439780 


47673 1 


AL1 09688 R23665 R26578 


440012 


483290J 


AA861072H02819R25946 


443309 


56604.1 


A1821 B74 A182186B AA630932 AA653B97 AA650103 AI821 1 31 AI821 1 24 


444163 


593658.1 


AI126098A1184746A1148521 


449325 


BO480J 


AA001 162 AA018950 AA017505 R84446 


451385 


86787J 


AA017656AA017374AA019761 


453823 


982526.1 


AL137967 BE064160 BE064186 


453846 


983043.1 


AL157586AL157590 


454789 


1234742.1 


BE156314 BE156316 AW820750 


455170 


1256906.1 


AW860972 AW862598 AW862599 AW860988 AW860983 AW860898 AW860925 AW860922 AW860986 


455189 


1259271.1 


AW8641 76 AW8641 33 AW8641 85 AW8641 37 


455380 


1287679.1 


BE160188AW935785BE160401 BE160319 BE160313 BE1B0395 


455650 


1348720.1 


BE064655 BE153953 


455797 


1366826 J 


BE091B33 BE091874 BE091871 


455807 


1370914.1 


BEt41140 BE141139BE141105 BE141143 BE141127 BE141202 BE141108 


455831 


1373969.1 


BE144966 BE144957 BE144958 


458829 


773443.1 


AI557388BE158936 


459169 


920641.1 


AI90551 7 AJ905455 A1905452 



TABLE 2C 



Pkey: Unique number corresponding to an Eos probesel 

Reft Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gi) numbers. "Dunham L el al." refers to the publication entitled "The DNA 

sequence of human chromosome 22." Dunham I. et at., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted . 

reposition: Indicates nucteotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposilion 


401016 


8117441 


Plus 


126234-126359,128050-128236 


401335 


9884881 


Plus 


15736-16352 


401555 


8099284 


Minus 


162520-162657 


401760 


9929699 


Ptus 


8312W3250,B5320-85540.94719-95287 


401781 


7249190 


Minus 


83215-83435,83531-83656,8374(W3901,B4237-84393.84955-85037,8629(W6814 


401961 


4581193 


Minus 


124054-1 24209 


402239 


7690131 


Pius 


38175-38304,42133-42266 


402305 


7328724 


Ptus 


40832-41362 


402424 


9796344 


Mnus 


64925-65073 


402777 


9588235 


Rus 


126786-126948 


402778 


9588235 


Plus 


128560*126702 


402837 


9369121 


Minus 


2013-2186,957^9758.11138-11^19429-19677,212^ 


402948 


9368458 


Minus 


143456-143626,143808-143935 


402952 


9408724 


Mums 


119452-119619 


403142 


9444521 


Plus 


89286-90131 


403297 


8096824 


Minus 


16584-17264 


403637 


8671936 


Minus 


142647-142771,145531-145762 


403657 


8843996 


Minus 


156223-156370 


404136 


6981900 


Minus 


4253846428 


404249 


8655533 


Plus 


64270^4633 


404875 


9801324 


Plus 


96588-96732.97722-97831 


404917 


7341851 


Plus 


4933049498 


404983 


4432779 


Mnus 


51178-51374,52000-52173 


405238 


7249119 


Minus 


51728-51836 


405364 


2281075 


fvfinus 


48325-48491,4913649252 


405531 


9665194 


Plus 


35602-35803 


405601 


5815493 


Minus 


147835-147935,149220-149299 


405621 


5523811 


Plus 


59362-59607 


405932 


7767812 


Minus 


123525-123713 


406117 


9142932 


Plus 


54304-54584 


406354 


9256049 


Minus 


2095-2377 


406548 


7711514 


Minus 


25138-26762 


405599 


8248616 


Plus 


10933-11086 
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Table 3A: Preferred therapeutic targets for bladder cancer 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnjgenelD: Unigene number 

Unigene Title: Unigene gene title 

HI: 90th percentile of bladder tumor Ais divided by the 90th percentile of norma) bladder sample Als 

R2: 90th percentile of bladder tumor Als divided by the 30th percentile of normal body sample Als 



Pkey 


ExAccn 


Unigene! D 


Unigene Title 


R1 


R2 


421948 


L42583 


Hs.334309 


keratin 6A 


14.20 


1.20 


439926 


AW014875 


Hs.1 37007 


ESTs 


11.31 


21.34 


413324 


V00571 


Hs.75294 


corticotropin releasing hormone 


9.15 


4575 


421110 


AJ250717 


Hs.1355 


cathepstn E 


9.07 


45.35 


417308 


H60720 


Hs.81892 


K1AA0101 gene product 


8.50 


1.99 


431211 


M86849 


Hs.323733 


gap junction protein, beta 2, 26kD {conn 


8.39 


1.28 


418406 


X73501 


Hs.84905 


cytokeratin 20 


8.10 


40.50 


446619 


AU076643 


Hs.313 


secreted phosphoprotein 1 {osteopofitin t 


7.98 


1.38 


433001 


AF217513 


Hs.279905 


done HQ0310 PRO0310p1 


7.67 


2.12 


408243 


Y00787 


Hs.624 


interlpukln 8 


7.56 


4.85 


417715 




Hs_8S366 


ESTs 


7.45 


4.70 


417720 


AA2D5625 


Hs 208067 


ESTs 


7.34 


9.18 


423673 




Hs.1695 


matrix metailoproteinase 12 (macrophage 


7.30 


26.07 


418007 


M13509 

IVI 1 


Hs 83169 


matrix metaDoproteinase 1 (interstitial 


7.12 


35.60 


413753 


U17760 


Hs 75517 


laminin, beta 3 (rucein (125kD), Icalinin 


6.95 


4.96 


441633 


AW958544 


Hs.1 12242 


normal mucosa of esophagus specific 1 


6.42 


0.89 


422168 


AA586894 


Hs 112408 


S1 00 calciunvbinding protein A7 (psorias 


6.08 


149 


407242 


M18728 




gb:Human nonspecific cross re acting antig 


5.96 


0.96 


405033 






C1 002652*:gi|544327|sp|Q04799]FMO5_.RABlT 


5.84 


16.22 


449230 


BE613348 


Hs.211579 


melanoma cell adhesion molecule 


5.82 


2.28 


406685 


M 18728 




gbiHuman nonspecific crcssre acting antig 


5.80 


0.89 


420159 


AI572490 


Hs 99785 


Homo sapiens cDNA; FU21 245 fis, clone C 


5.77 


28.85 


415511 


AI732617 


Hs.1 82362 


ESTs 


5.65 


28.25 


426028 


NM 001110 


Hs.1 72028 


a oTsintegrin and metaJloproteinase dome 


5.60 


6.51 


424008 


R02740 


Hs 137555 


nntativR du»mokinft racpninr GTP-hindina 


5.59 


Z33 


428651 


AF1Qfid7R 


no. i oohw i 


annaxtn A1 0 


5.55 


27.75 


*tUUO*W 






NM nmi(Tn* - Hnmn caniMts KMliltiwplatpri 
iKi¥i_vuj iuj .nujiiuoapjciio dMuumciaieu 


5.51 


4.92 


402230 








5.36 


21.44 


HOZ# Hi 


DC I3O03O 


ni.D 1 Huu 


tfi citnorfanwlv rtfonttv 1 MIR 


5.33 


2.80 


1IOUOD 




U q 70QQC 


proliferating cell nuclear antigen 


5.17 


1.98 


428450 


NM D147Q1 


Hs 184339 

no. itrtvwj 


K1AA0175 OPTtP. nmriiirt 


4l90 


£63 


418322 


rJ\£OH 1 DO 


no. oh 1 1 o 


rurfln-HftnAndpnl kinase inhiHtnr 1 1CDK. 


4.77 


2.35 




A9U9UO 


no. f «♦ IZO 


fath/ w»?H hinrftnn nmteln ft Hps! Inas. 
I ally dUU WllUUiy piUlcUl O, Ileal (j)S 


4.77 


3.71 


418663 


AK001100 


Hs 41590 

l lo.*t 1 U3U 


Waemncotltn 3 


4.74 


1.48 


414683 


S78296 


Hs 7688R 

IK>. / QOQO 


hvnnHiPticat nrofptn MRf!197fl? 


4.74 


2.92 


442432 


BE093589 


Hs 38178 


hvmHtPticst nmfpin FU23488 


4.68 


1.61 


424834 


AK001437 


Hs 153408 


Homn Romans cDNA FU 10570 fis done NT 


4.65 


11.63 


431958 


X63629 


Hs.2877 


cadherin 3, type 1, P-cadherin (placenta 


4.63 


2.06 






Up 1*19197 
no. 1 oc It/ 




4.55 


3.35 


401780 






NM 0Q55S7*'Homn saotens keratin 16 ffnea 


4.49 


1.62 




AWQ75531 


Hs.1 54443 


minMirfirrvMnnift mantpnanm rtflfn^ipnt ffi 

IllJlllUlliWUIUOUItIO IIIdUILClKJIIwO VOilMCJIl \W. 


4.43 


2.39 


401093 






C120(J0586*:gil6330167|dbj|BAA86477.1| (A 


440 


12.94 


417933 


X02308 


Hs.82962 


thymidytate synthetase 


4.35 


Z29 


418113 


AI272141 


Hs.83484 


SRY (sex determining region Y)-box 4 


4.32 


Z82 


412140 


AA219691 


Hs.73625 


RAB6 interacting, kinesin-Cke (rabkines 


4.24 


8.15 


401781 






Target Exon 


4.15 


1.31 


425234 


AW152225 


Hs.165909 


ESTs, Weakly simaar to I38022 hypotheti 


4.14 


7.39 


432842 


AW674093 


Hs.334822 


hypothetical protein MGC4485 


4.12 


2.51 


408380 


AF1 23050 


Hs.44532 


diublquffin 


4.11 


3.26 


449722 


BE280074 


Hs.23960 


cycDnBl 


4.09 


172 


420344 


BE463721 


Hs.97101 


putative G protein-coupled receptor 


4.07 


2.50 


422809 


AK001379 


Hs.1 21028 


hypothetical protein FU 10549 


4.00 


7.14 


404977 






Insutin-fike growth factor 2 (somatormdi 


3.89 


5.17 


400409 


AF153341 




Homo sapiens winged helix/forkhead trans 


3.88 


7.29 


429113 


D28235 


Hs.196384 


rrfostaglandin-endoperoxide synthase 2 (p 


3.87 


1.61 


444371 


BE540274 


Hs.239 


forkheadboxMI 


187 


2.75 


443171 


BE281128 


Hs.9030 


TONDU 


183 


9.48 


441362 


BE614410 


Hs.23044 


RAD51 (S. cerevisiae) homotog (EcoHRe 


3.82 


2.98 


439963 


AW247529 


Hs.6793 


platelet-activating factor acetythydrola 


177 


163 


407137 


T97307 




gb7e53h05.s1 Soares fetal fiver spleen 


173 


4.91 


426088 


AF038007 


Hs.166196 


ATPase, Qass 1, type BB, member 1 


168 


7.08 


443426 


AF098158 


Hs.9329 


chromosome 20 open reading frame 1 


168 


2.29 


414761 


AU077228 


Hs.77256 


enhancer of zeste (Drosophira) homotog 2 


167 


106 


413063 


AL035737 


Hs.75184 


chitmase 3-fike 1 (cartilage glycoprote 


167 


1.18 


421508 


NM_004833 


Hs.105115 


absent in melanoma 2 


3.67 


165 


443162 


T49951 


Hs.9029 


OKFZP434G032 protein 


3.66 
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418476 


U38945 


Hs.1174 


cyclln-dependenl Hnase inhibitor 2A (me 


3.66 


3.57 


417771 


AA804698 


Hs.82547 


retlnoic acid receptor res ponder (tazaro 


3.62 


Z73 


441495 


AW294603 


Hs.127039 


ESTs 


3.60 


Z71 


422282 


AF019225 


Hs. 11 4309 


apofipoprolein L 


3.57 


3.92 


417079 


U6559Q 


Hs 81134 


interteuWn 1 receptor antagonist 


3.55 


0.80 


417275 


X63578 


Hs.295449 


parv albumin 


3.54 


4.60 


4400% 


AK000517 


Hs.6844 


hypothetical protein FU20510 


3.52 


Z59 


418203 


X54942 


Hs 83758 


COC28 protein kinase 2 


3.50 


161 


400289 


X07820 


Hs.2258 


malrit materJoomfelnasB 10 fstromelvsin 


3.50 


17.50 


404875 






MM 022R1Q**Hnmo sanrens ohftsnhnlioase A2 


3.46 


3.24 


420005 


AW271106 


Hs 133294 


ESTs 


140 


Z22 


409757 


PfHM_vW 1050 


Hs 173114 




139 


Z93 


427719 


AI393122 


Hs 134726 


ESTs 


3.31 


Z51 


406690 


M29540 


Hs 2?fK?Q 


r^rrinrtpmiwnntr an!inon-/p|pfpH cell ad 


3.28 


0.42 


422283 


AW41 1307 


Hs 11d311 


CDC45 fcall division cvcte 45 Scerevis 


3.28 


2.62 


406081 






Target Exon 


3.25 


13.54 


426514 


HFG1G633 


Hs 170195 


bone morphogenetic protein 7 (osteogenic 


125 


Z46 


431009 


BE 149762 


Hs.48956 


gap junction protein, beta 6 (connexin 3 


123 


Z88 


429983 


W92620 


Hs.260855 


ESTs 


3.20 


Z84 


422158 


L10343 


Hs.1 12341 


protease inhibitor 3, skin-derived (SKAL 


3.18 


1.54 


426451 


A1908165 


Hs. 169946 


GATA-binding protein 3 (T-cell receptor 


3.17 


5.44 


411945 


AL033527 


Hs.92137 


v-myc avian myebcytomatosis vtraf ortcog 


3.13 


15.65 


415752 


8E314524 


Hs.78776 


putative transmembrane protein 


111 


Z46 


408633 


AW963372 


Hs.46677 


PRO2000 protein 


3.11 


3.30 


409956 


AW1 03364 


Hs.727 


inhibin, beta A (activln A, activin Afi a 


3.09 


1.52 


412420 


AL035668 


Hs.73853 


bone morphogenetic protein 2 


3.08 


Z22 


400297 


All 27076 


Hs.306201 


hypothetical protein DKFZp56401278 


3.05 


1Z49 


437931 


AI249468 


Hs. 124434 


ESTs 


3.01 


3.70 


421451 


AA291377 


Hs,50B31 


ESTs 


2.99 


14.95 


426682 


AV660038 


Hs.2056 


UDP glycosyltransferase 1 family, polype 


2.97 


Z10 


402239 






Tarofil Exon 


2.97 


3.37 


429345 


R11141 


Hs. 199695 


hypothetical protein 


2.96 


Z61 


435904 


AF261655 


Hs.8910 


1,2-aipha-mannosidas6 1C 


2.93 


Z13 


423961 


013666 


Hs.136348 


periostin (OSF-2os) 


2.93 


1.44 


420923 


AF097021 


Hs 273321 


differentially expressed in hematopoieti 


^92 


0.95 


436608 






down syndrome critical region protein DS 


Z92 


4.66 




AA256641 




PSTs Hiohlv similar k> S02392 rfnha-9-jTJ 


2.88 


Z93 


439223 




Hs 250618 


UL16 blndirta oroteln 2 


2.88 


Z15 








HrtmA Cflnlom kpniHn 17 "KRT17\ 


2.88 


3.44 




MVVZtQOUO 


Hs 27Q777 
na.t/ St ci 


FSTs* hnmnbviiip nf PFM-3 fpjnm cavkinvi 


2.8B 


14.30 


444444 


AJ149332 


Hs. 14855 


ESTs 


2.85 


Z68 


491 inn 

Hi l IUU 


mi JO 1003 


Hs 194fifif1 
na. izhoou 




2.84 


2.46 


411070 
'MlUfU 


AWiTlfllftd 


Rs 9AQ1RA 




2.78 


2.26 


4177RQ 








2.77 


2.34 


AAOQQA 






ESTs 


2.75 


2.82 


AAiVli 
*WtOOl 


□cJO/OOO 


Hs 3R3713 
ns.zojr io 


FCTs. WpaWv similar to S64054 hvtnlheH 


2.74 


2.44 


A1AAU7 




Uo 117RRA 




2.72 


137 




a i n3fti7n 

nUJOo 1 f u 


Hs fi07 ( iR 
na.ou/ 00 


hntpinf^hnmnrvstMnp nrpthvltransfnrasn 
uu lai i fen *ui l ic iitcuijiuoi iditucua 


2.69 


2.70 


4fli44n 






KIM Q2104ft*Hnmt ssoisns mfilsrwvns anKfipn 


2.69 


13.45 


400844 








2.69 


13.45 


426322 


tfKQfifl 


Hs.2012 


trsnscobsSsn^n 1 {^rftarnin B12 binding pr 


2.69 


1.36 


431448 


ai 1T7K17 

10/ O 1 1 


Hs 30R201 


hvnnthptral runt pi n nkF7nfiAdfl1 97R 


2.69 


1Z08 


403381 






ENSP0000023 1 844*: Eco tropic virus Integra 


Z68 


13.40 


411248 


AA55153B 

rVWy 1 wvV 


Hs. 334605 


Homo sapiens cDNA FU 14408 fis, done HE 


Z68 


Z43 


425206 


MM 002153 


Rs.155109 


hydroxysteroid (17*beta) dehydrogenase 2 


Z67 


Z68 


435099 


AC004770 


Hs.4756 


fipp structure-specrric endonudease 1 


Z67 


1.79 


409361 


MM 005982 


Hs. 54416 


sine ocuils homeobox (Drosophila) homolo 


2.65* 


0.73 


413281 


AABS1271 


Hs.222024 


transcription factor BMAL2 


Z65 


Z23 


446082 


AJ274139 


Hs. 156452 


ESTs 


Z65 


Z65 


422424 


AI1R6431 


Hs.296638 


prostate dirTerentiaton factor 


Z64 


Z68 


407839 


AA0d5144 


Hs. 161566 


ESTs 


2.64 


1.08 






Hs 163484 


ESTs 


2.64 


6.14 


417312 


AW888411 


Hs.250811 


leukemia-assotiated phosphoprotetn p18 { 


Z64 


1.31 


430157 


BE348706 


Hs.278543 


ESTs 


Z63 


Z58 


436481 


AA379597 


Hs.5199 


HSPC150 protein similar to ubiottfrKon 


Z61 


1.93 


416686 


235830 


Hs.87268 


annexin Afi 


Z60 


1.62 


430486 


BE062109 


Hs.241551 


chloride channel, caldum activated, tarn 


Z59 


Z87 


429276 


AF056085 


Hs.198612 


G protein-coupled receptor 51 


Z57 


189 


439738 


BE246502 


Hs.9598 


sema domain. Immunoglobulin domain (Ig), 


Z57 


Z49 


409632 


W74001 


Hs.55279 


serine (or cysteine) proteinase inhibito 


Z56 


1.43 


414812 


X72755 


Hs.77367 


monokine induced by gamma interferon 


Z54 


110 


451668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


2.51 


3.60 


421379 


Y15221 


Hs.103982 


small inducible cytokine subfamOy B (Cy 


Z50 


Z91 


414416 


AW409985 


Hs.76084 


hypothetical protein MGC2721 


Z49 


1.78 


429612 


AF062649 


Hs.252567 


pituitary bmor-transtonriing 1 


Z49 


1.80 


416658 


U03272 


Hs.79432 


fibriirm 2 (congenital contracture) ara 


Z49 


146 


437553 


AJ829935 


Hs.130497 


ESTs, WeaWy similar to MAT8.HUMAN CHLOR 


Z48 


Z36 
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418941 


AA452970 


Hs.239527 


E1B-55kDa-associated protein 5 


2.46 


2.33 


414807 


AI738616 


Hs.77348 


hydroxyprostaglandin dehydrogenase 15-(N 


Z44 


Z49 


450983 


AA305384 


Hs.25740 


ER01 (S. cerevislae)-lixe 


2.43 


1.42 


407788 


BE514982 


Hs.38991 


S100 calcium-binding protein A2 


Z41 


2.05 


449019 


AI949095 


Hs.67776 


ESTs, Weakly slmBar to T22341 hypolheli 


2.40 


1.90 


417366 


BE1 85289 


Hs.1076 


small proline-rich protein 1B (comifin) 


2.40 


0.49 


420370 


Y13645 


Hs.97234 


uroplakin 2 


2.39 


3.81 


408000 


L11690 


Hs.198689 


bullous pemphigoid antigen 1 (230/240kD) 


2.38 


1.45 


406399 






NMJ)03 122*: Homo sapiens serine protease 


2.38 


4.20 


435563 


AF210317 


Hs.95497 


solute carrier family 2 (facilitated giu 


2.31 


4.05 


442117 


AW664964 


Hs.1 28899 


ESTs; hypothetical protein for IMAGE:447 


2.31 


1.17 


436246 


AW450963 


Hs.1 19991 


ESTs 


2.30 


11.50 


433078 


AW015188 


Hs.121575 


Homo sapiens cDNA FU 12231 fis, done MA 


2.30 


2.40 


424012 


AW368377 


Hs 137569 


tumor protein 63 kDa with strong homoiog 


2.29 


1.89 


411263 


BE297802 


Hs.69350 


Wnesin-Cke 6 (mitotic centromere- as soc 


2.28 


2.14 


432829 


W60377 


Hs 57772 


ESTs 


2.28 


4.85 


415025 


AW2070S1 


Hs.72307 


ESTs 


2.28 


11.40 


436293 


AIG01188 


Hs 120910 


ESTs 


2.27 


3.80 


415989 


A! 267700 




ESTs 


2.27 


11.35 


418067 


A11 27958 


Hs.83393 


cystatin E/M 


2.25 


1.54 


436291 


BE568452 


Hs 344037 


protein regulator of cytokinesis 1 


Z25 


2.45 


422278 


AF072873 


Hs.114218 


frizzled (Drosophila) homoiog 6 


2.25 


1.55 


428479 


Y00272 


Hs.334562 


cell dh/ision cycle 2, G1 to S and G2 to 


Z22 


11.10 


443247 


BE614387 


Hs.333893 


c-Myc target JP01 


Z21 


1.32 


424364 


AW383226 


Hs.201189 


ESTs, Weakly similar to G01763 atrophln- 


Z21 


1.52 


405932 






Cl5000305:gi|3806122lgblAAC69198.1l (AF0 


2.20 


1.55 


419741 


NMJJ07019 


Hs.93002 


ubiquitin carrier protein E2-C 


2.19 


1.99 


423271 


W47225 


Hs.126256 


interleukin 1, beta 


Z19 


Z01 


402305 






C1 9000735*:gil4508027]rBfl NP_0034 14.1 1 z 


2.19 


Z54 


421064 


A1245432 


Hs.101382 


tumor necrosis factor, alpha-induced pro 


Z19 


Z16 


427747 


AW411425 


Hs.180655 


serine/threonine kinase 12 


Z18 


1.80 


437181 


AI306615 


Hs.125343 


ESTs, Weakly similar to KIAA0758 protein 


Z17 


10.85 


431890 


X17033 


Hs.271986 


folegrin, alpha 2 (CD498, alpha 2 subuni 


Z17 


Z14 


433437 


U20536 


Hs.3280 


caspase 5, apoptosis-related cysteine pr 


Z16 


1.37 


407581 


R48402 


Hs.1 73508 


P3ECSL 


Z15 


1.95 


400845 






NMJXHIOSVHomo sapiens sortifirwelaied 


Z15 


Z23 


419359 


AL043202 


Hs.90073 


chromosome segregation 1 (yeast homoiog) 


Z14 


1.70 


418526 


BE019020 


Hs.85838 


solute carrier family 16 (monocarboxylic 


Z13 


1.83 


448045 


AJ297436 


Hs.20166 


prostate stem cell antigen 


Z13 


3.49 


453459 


BE047032 


Hs.257789 


ESTs 


Z13 


Z30 


419183 


U60669 


Hs.89663 


cytochrome P450, subfamily XXIV (vitamin 


Z12 


10.60 


436251 


BE515065 


Hs.296585 


nucleolar protein (KKE/D repeat) 


Z11 


1.89 


445911 


AJ985987 


Hs.145645 


ESTs. Moderately similar to ALU1_HUMAN A 


Z11 


2.57 


420876 


AA918425 


Hs.1 77744 


ESTs' 


Z09 


Z30 


438817 


A1023799 


Hs.1 63242 


ESTs 


Z09 


10.45 


434293 


wu 004445 


Hs.3796 


EphB6 


Z08 


Z42 


422765 


AW409701 


Hs.1578 


baculoviral LAP repeat-containing 5 (sur 


Z08 


Z10 


418216 


AA662240 


H$J83099 


AFtSq 14 protein 


Z08 


4.62 


437915 


AI637993 


Hs.202312 


Homo sapiens clone N1 1 NTera2D1 teratoca 


Z08 


10.40 


409420 


215008 


Hs.54451 


laminin, gamma 2 (nicein (100kD), katln) 


Z05 


5.39 


431441 


U81961 


Hs.2794 


sodium channel, nonvoltage-gated 1 alpha 


Z05 


1.20 


400773 






NM_003105*:Homo sapiens sortiBn-related 


Z05 


1.78 


414987 




H«i5Q4fl5? 


hypothetical protein FU 14950 


2.04 


Z06 


424687 


J05070 


Hs.1 51738 


matrix metaOoproteinase 9 (gelatinase B 


Z04 


1.70 


444476 


AF020Q38 


Hs.1 1223 


isocitrale dehydrogenase 1 (NAOP), solub 


Z03 


Z26 


447437 


U07225 


Hs.339 


purinergic receptor P2Y, G-protein coup) 


Z02 


1.73 


445537 


AJ245671 


Hs. 12844 


EGF-fike-domain, multiple 6 


Z02 


Z93 


407601 


AC002300 


Hs.37129 


sodium channel, nonvoltage-gated 1, beta 


Z02 


1.55 


441801 


AW242799 


Hs.86366 


ESTs 


Z01 


10.05 


439780 


AL 109688 




gbiHomo sapiens mRNA full length insert 


ZOO 


10.00 


452732 


BE300078 


Ks.80449 


Homo sapiens, clone IMAGE:3535294, mRNA, 


zoo 


1.53 


434876 


AF160477 


Hs.61460 


Ig superfamiiy receptor LN1R 


zoo 


1.91 


430152 


AB001325 


Hs.234642 


aquaporin 3 


1.99 


1.74 


453134 


AA032211 


Hs.118493 


ESTs 


1.99 


3.16 


412719 


AW016610 


Hs.816 


ESTs 


1.99 


0.34 


442577 


AA292998 


Hs.1 63900 


ESTs 


1.99 


3.09 


409402 


Af 208234 


Hs.695 


cystatin B (stefin B) 


1.98 


1.50 


414774 


X02419 


Hs.77274 


plasminogen activator, urokinase 


1.97 


1.84 


439318 


AW837046 


Hs.6527 


G protein-coupled receptor 56 


1.95 


1.42 


447334 


AA515032 


Hs.91109 


ESTs 


1.95 


Z53 


432015 


AL157504 


Hs.159115 


Homo sapiens mRNA; cDNA OKFZp586O0724 (f 


1.94 


9.70 


429002 


AW248439 


Hs.2340 


function plakoglobin 


1.94 


1.57 


410553 


AW016824 


Hs.255527 


hypothetical protein MGC14128 


1.94 


Z02 


420783 


AI659838 


Hs.99923 


tecfa'n. gaiactoside-btndtng, soluble, 7 


1.93 


0.56 


407811 


AW190902 


Hs.40098 


cysteine knot superfamOy 1, BMP antagon 


1.92 


1.01 


448988 


Y09763 


Hs.22785 


gamma-aminobutyrfc acid (GABA) A recepto 


1.92 


1.47 


400303 


AA242758 


Hs.79136 


UV-1 protein, estrogen regulated 


1.92 


1.59 
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414918 


A1219207 


Hs.72222 


hypothetical protein FU 13459 


1.92 


£77 


424522 


AH 34847 
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407846 


AA426202 


Hs.40403 


Cbp/p300-interacting transactivator, wit 


0.80 


0.57 


457292 


AI921270 


Hs.281462 


hypothetical protein FU14251 


0.77 


1.40 


431089 


BE041395 




ESTs, Weakly sirriar to unknown protein 


0.76 


14.88 


459702 


A1204995 




gb:an03c03.x1 Stratagene schizo brain S1 


0.74 


11.03 


424503 


NMJJ02205 


Hs.149609 


integrin, alpha 5 (fibronectin receptor, 


0.56 


0.80 


406964 


M21305 




gb:Human alpha satellite and satellite 3 


0.53 


11.16 


416225 


AA577730 


Hs.188684 


ESTs, Weakly similar to PC4259 ferritin 


0.48 


3.01 


400288 


X08256 


Hs.149609 


integrin, alpha 5 (fibronectin receptor, 


0.40 


0.70 


421218 


NM.000499 


Hs.72912 


cytochrome P450, subfam3y I (aromatic c 


0.40 


0.61 



Pkey: Unique Eos probes st identifier number 
CAT number. Gene cluster number 



Accession: 


Genbank acce; 


jsion numbers 


Pkey 


CAT Number 




411880 


1263110J 


AW^77BE088101T05990 


413804 


1390710J 


T64682BE1 681 90 BE1 68256 


414221 


14269SJ 


AW450979AA136653AA136656AW419381 AA984358 AA492073BE1 68945 AA809054AW2380388E011 212 BE01 1359 BE01 1367 






BE011368 BE011362 BE011215 BE011365 BE01 1363 


415989 


155454 J 


A1267700 AI720344 AA191424 Ai 023543 A1469633 AA172056 AW958465 AA172236 AW953397 AA355086 


431089 


327825.1 


BE041395 AA491826 AA621946 AA715980 AA666102 


431322 


331543 1 


AW970622 AA503009 AA502998 AA502989 AA502805 T92188 


432222 


343347 1 


A1204995 AW827539 AW969908 AW440776 AA528756 


435608 


42361.3 


AA628980 AI126603 BE504035 


439780 


47673J 


AL109688R23665 R26578 


444163 


593658 1 


AJ 1 26098 A1 1 84 746 AJ 1 48521 


445182 


632151 1 


AW169787 AI215430 AW268499AW205930A1392907 BE093017 BE093019 BE093010 


451844 


888230.1 


T61430 AI820546 AI821335 


456034 


142696.1 


AW450979AA136653 AA136656AW419381 AA984358AA492073 BE 168945 AA809054AW23B038BE011212BE011359 BE011367 






BE01 1368 BE0 11362 BE011215 BE011365 BE011363 



TABLE 3C __ 

Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et at' refers to the publication entitled The DNA 

sequence of human chromosome 22/ Dunham I. et at.. Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposiOon: Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand reposition 

400773 8131629 Minus 441 1 6-44238,48208-48321 
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WO 03/003906 



PCT/US02/21338 



400843 


9188505 


Plus 


5863-5970,7653-7784,8892-9023,9673-9807,10634-1 0789,1 5254-1 5403,23827-23958 


400844 


9188505 


Pius 


24746-24872,25035-25204 


400845 


9188605 


Plus 


34428-34612 


400846 


9188605 


Plus 


39310-39474 


401093 


B516137 


Minus 


22335-23166 


401747 


9789672 


Minus 


118596-1 18816,1 191 19-119244.119609-119761,120422-120990,130161-130381,130468-130593,131097-131258.131866- 
131932,132451-132575,133580-13401 1 


401780 


9929699 


Plus 


83126-83250,85320-85540,94719-95287 


401780 


7249190 


Minus 


28397-2861 7. 28920-29045,291 35-29296.2941 1 -29567, 29705-29787,30224-30573 


401781 


7249190 


Minus 


83215-83435,83531-83656i83740^3901,84237-64393 t 84955-85037,86290-86814 


402075 


B1 17407 


Plus 


121907-12035,122804-122921.124019-124161,124455-124610,125672-126076 


402230 


99S6312 


Minus 


29782-29932 


402239 


7690131 


Plus 


38175-38304,421 33-42266 


402260 


3399665 


Minus 


113765-113910,115653-115765,116808-116940 


402305 


7328724 


Plus 


40832-41362 


402424 


9796344 


Minus 


54925-65073 


402777 


9588235 


Plus 


126786-126948 


402901 


8894222 


Minus 


175426-175667 


403047 


3540153 


Minus 


59793-59958 


403381 


9438267 


Minus 


26009-26178 


403903 


7710871 


Minus 


101165-102597 


404403 


7272157 


Minus 


72053-72238 


404440 


7528051 


Plus 


80430-81581 


404875 


9801324 


Plus 


96588-96732,97722-97831 


404877 


1519284 


Plus 


1095-2107 


404977 


3738341 


Minus 


43081-43229 


405033 


7107731 


Minus 


142358-142546 


405932 


7767812 


Minus 


123525-123713 


406081 


9123861 


Minus 


38115-38691 


406399 


9256288 


Minus 


63448-63554 


406434 


9256651 


Minus 


17803-17931 


406467 


9795551 


Plus 


182212-182958 
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TABLE 4A: Preferred diagnostics for bladder cancer 

Pkey: Unique Eos probeset identifier number 

ExAocn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Rl : 80th percentile of muscle-invasive bladder tumor (stags T2-T4) Als divided by the 80th percentile of exophytic non-invasive carcinoma (stage Ta) Ais 

R2 90th percentile of bladder tumor Als minus background divided by 90th percentile of norma) body sample Als minus background, where background equals 

the 15th percentile of a! sample Als 

R3 90th percentile of bladder tumor Als divided by the 90ih percentile of normal body sample Als 



Pkey 


ExAccn 


1 Ininpnotn 


llntnpnp Titlp 
unryciic hub 


R1 


R2 


R3 


423961 


D1 3666 


no. iJlxrKJ 


neib&tin I OSF -?cts) 

fSCJIUOUII \V/OI C\JOJ 


11.22 


1.40 


1.44 


446619 




Hs.313 


cpptpJpH nhMnhftfWiWn 1 frrcfpftryintift 
acuncu ^iiuopi(upiuus<ii i ^uoicu^ajmuh, 


8.40 


1.31 


1.38 




OtOOf OOO 


(To. ZOO/ tO 


P^T«t Wpaklv qimflar fn ^ fv40S4 hvralheti 
Co ib, vrcowy oiiiuicti 10 3Q*tvw** iiyjMuicu 


7.88 


2.08 


Z44 


408243 


Y00787 


Hs.624 


interleukin 8 


7.54 


Z86 


4.85 


413063 




Hs.75184 


nhitinase 34Tke 1 feartBanfl alveonrota 


6.14 


1.18 


1.18 


414020 


NM 002984 


Hs.75703 


small Inducible cytokine A4 (homologous 


5.32 


1.34 


1.36 


424247 


X 14008 


Hs.234734 


rysozyme (renal amyloidosis) 


5.27 


0.61 


0.57 


418007 


M13509 


Hs.83169 


matrix metalloprotelnase 1 (interstitial 


5.17 


7.47 


35.60 


422158 


L10343 


Hs.1 12341 


protease inhibitor 3, skin-derived (SKAL 


4.30 


1.75 


1.54 


446921 


AB0121 13 


Hs. 16530 


smaQ inducible cytokine subfamily A (Cy 


4.29 


1.35 


1.60 


413324 


V00571 


Hs 75294 


corticotropin releasing hormone 


4.20 


a27 


45.75 


412429 








4.00 


0.94 


0.93 


406636 


L12064 




gb:Homo sapiens (clone WR4.1 2VL) antMh 


3.66 


1.25 


(.25 


428330 


L22524 


Hs.2256 


malrTK mptsdnnm^flinaQp 7 /mnhilvsin 
MiouiA iiiciBiiU|/iuiatiiaoo r \iiiauii/oiii, 


3.58 


2.07 


2.26 


4flfifiH7 

€ +UOOOf 


M31126 




mstiTY mpt alloDrntplnARP 1 1 /stmiTiPlvsin 

III0U1A II R^laJIwpi UUSIIIPOw 1 1 \9U WillCIJOfH 


3.41 


4.37 


3.37 


422550 






mirmfihrittsir.flfKinfliifoH nmiain A 
IIDvlUltlHIItOJ ooouuiaiwJ piUlOJII *T 


3.09 


0.39 


0.40 


AinflR7 


ADOOOD 


Hs 750 


fihrilUn 1 /Marfan cvnrtmmft\ 


2.96 


0.44 


0.45 


1 / t}OO0 




Ue 704^9 


1IUJ1JII) £ \\MiTLfWUlci COnUfMMiial at d 


2.90 


1.94 


3.46 


414812 


X72755 


no. 1 1 v>u i 


mnnnkino inHi if^oH ho/ namma Jnforfomn 

■ IiUjIUauI9 IIIUIA/CU Uy UczJllJIJa UHdldVll 


ZBQ 


1.67 


3.10 


HtOOf O 


DCUUJUO*t 


no- 1093 


matriv mataHnnmfatnaep 49 fmonrnnhanp 

liRHIM IIICl£DIUpJUiOUIdOC 1 L. \\ \ USLA UjJI Id^C 


2.77 


5.62 


26.07 


421379 


Y15221 


no. iuo30t 


email irviiiHhln pvfnltlnp cnhfamilv H fHtf 

OllKUl UMUWUKJ UJlUIUllO OUMSIIUIJ D \ Vjf 


2.71 


2.26 


2.91 


*t£90UU 


A/0303 


Met TflQ.1 1 A 
no. £03 l it 


hnvahr^hinfi ftonaergi C mlntartin^ 

IraAGUJ oLI MLB 1 ^ICildavHl w, bjiUlabUIIJ 


2.40 


0.47 


0.37 




mVta 100/ 


no.Oi/OO 


nidogen 2 


2.34 


0.88 


0.85 




MrUO i «040 




Target 


2.33 


1.54 


2.12 


40/011 




vie 4nnQft 


cystoma kiwi supenoiiuiy i, otnr anioyoii 


Z10 


1.01 


1.01 


AftMQQ 
4UU£09 


au/ozu 


H5.ZZDO 


matrix metaQoproteinase 10 (stromelysin 


1.99 


3.19 


17.50 


A4A77A 
414/ /4 


vn94 1 q 


HS,//*/4 


plasminogen activstor, urokinase 




2.49 


1 ftd 


41)94 ZU 


i/iouuo 


Ue 

n5-044ol 


larrHnin, gamma l tnjcein (iuukl/;, Kairn 






3.03 


428227 


AA321649 


Us 


smau BiQUCiDte cyiOKine suwarruiy o 




i.yy 


Q CC 

0.00 


414476 


AAoUiobr 


U« 7C90JI 
HS.fO^/4 


cur-coniaining nouurHiKe exuaceuuia 


1 7R 
l./O 


0.33 


U.J 1 


431639 


AK000680 


U» OCff47C 


phosphoprotein associated with GEMs 


l./O 


1.10 


1.10 


421958 


A A 1C~H QC 

AAoo/loo 


Up mooifi 

Hs.iuyyio 


ras homolog gene family, member H 


1 75 


1.0 J 


1 97 

1.4/ 


425247 


NM_005940 


HS. 105324 


matrix metatloprotetnase 1 1 (stromelysin 


1.00 


0 AO 


1. 03 


AAAIAt 

444o4z 


NM_U14JS0 


Lie 10007 


similar to tysosonro- associated membrane 


1 57 
1.0/ 


2.02 


8.55 


421493 


□couu34i 


Ue lA/IOOfi 


ecioo Brrnaj-neuraJ conex [wim □ i d-ukb 


1.00 


1.57 


I.OO 


■IA7QQQ 


WUooUo 


Ue 110K7Q 


CQT e WaoVIu clmilar fn A^Qf)1 Q Hvnoin Kp 

co I a, woawy similar uj m*}su i y uynein ihj 


1.41 


(17Q 
v.ry 


u.o/ 


429344 


anjMa 
Ky4UOO 


Ue IQMTfl 

ns.iyyooo 


tnKSKtn hp) a P 

uiniutn, DB13 u 


1.00 


i.jy 


I.O** 








f\ fvi_Ut jl* □ □ . nui iio sapiens nypouiBucdi pros 


1.34 


1.34 


1.56 


412420 


AL035668 


Hs.73853 


bone morphogenefc protein 2 


1.31 


t63 


Z22 


423217 


NMJJ00094 


Rs.1640 


collagen, type VII, alpha 1 (epidermotys 


1.27 


zoo 


1.67 


424206 


NMJW3734 


Hs.198241 


amine oxidase, copper containing 3 (vase 


1.25 


0.16 


0.30 


415138 


C18356 


Hs.295944 


tissue factor pathway inhibitor 2 


1.23 


1.70 


4.34 


424479 


AF064238 


Hs.149098 


smoothelin 


1.19 


0.27 


tt47 


445537 


AJ245671 


Hs.12844 


EGF-ttke^omain, multiple 6 


1.17 


1.59 


Z93 


417079 


U65590 


Hs.81134 


interleukin 1 receptor antagonist 


1.16 


0.82 


0.80 


421634 


AA437414 


Hs.106283 


hypothetical protein FU10262 


1.16 


1.05 


1.05 


439569 


AW602166 


Hs.222399 


CEGP1 protein 


1.15 


Z01 


Z42 


431346 


AA371059 


Hs.251636 


ubiquitin specific protease 3 


1.10 


1.64 


1,52 


448901 


AK001021 


Hs.22505 


hypothetical protein FU10159 


1.10 


0.31 


0.31 


450983 


AA305384 


Ks.25740 


ER01 (S.cerevisiaeHike 


1.03 


1.51 


1.42 


422424 


A1186431 


Hs.296538 


prostate dirTerenbatton factor 


1.02 


2.77 


Z68 


458781 


AJ444821 


Hs.63085 


ESTs, Weakly similar to MPP3_HUMAN MAGUK 


1.00 


1.64 


5.45 


445413 


AA151342 


Hs.12677 


CG1-147 protein 


1.00 


1.51 


5.20 


432350 


NWL005865 


Hs.274407 


protease, serine, 16 (thymus) 


1.00 


1.55 


4.30 


403106 






C8000064*:gIl10432393|emb|CAC10283.1| (A 


1.00 


1.48 


4.24 


402075 






ENSP00000251056*Ptasma membrane catehrm 


1.00 


1.67 


395 


404860 






C1 003394*:gi|1 231 4 272|emb| CAC00591 . 1 1 (A 


1.00 


1.40 


3.90 


434037 


AF116601 




WW domain-containing oxidoreductase 


1.00 


1.58 


3.70 


405738 






aW0390*:gIl6014646lgb|AAF01438.1lAF187 


1.00 


1.3S 


Z95 


427585 


031152 


Hs.179729 


collagen, type X, alpha 1 (Schmld metaph 


1.00 


1.45 


1.60 


439898 


AW505514 


Hs.209561 


WAA1715 protein 


1.00 


1.28 


1.59 


452567 


D87120 


rfc29882 


predicted osteoblast protein 


1.00 


1.10 


1.31 


401271 






C90Q0559*:gi|1 2314195|ernbiCAB99338.1 1 (A 


1.00 


Z12 


1.00 


411339 


BE164598 


Hs.274251 


rrypothetical protein FU20375; WAA1797 


1.00 


Z05 


1.00 



162 



WO 03/003906 



PCT/US02/21338 



403005 






C21 000027*:gi|1817556|dbj|BAA1 3672.1 j (D 


1.00 


1.89 


1.00 


431146 


283850 




Human DNA sequence from PAC 82J1 1 and co 


1.00 


1.69 


1.00 


434939 


AF161422 


Hs.305567 


Homo sapiens HSPC304 mRNA, partial cds 


1.00 


1.86 


1.00 


431753 


X76029 


Hs.2841 


neuromedin U 


1.00 


1.82 


1.00 


419121 


AA374372 


Hs.89626 


parathyroid hormone-Glee hormone 


1.00 


1.69 


1.00 


435505 


AF200492 


Hs.211238 


lnterleuWn-1 homoiog 1 


1.00 


1.67 


1.00 


452401 


NMJJ07115 


Hs.29352 


tumor necrosis factor, alpha-induced pro 


1.00 


1.67 


1.00 


406397 






C16001447*:gi[12053709|emblCAC20419.1| ( 


1.00 


1.66 


1.00 


404488 






NM_030958*:Homo sapiens organic anion tr 


1.00 


1.56 


1.00 


441206 


BE552314 


Hs.131823 


ESTs, Weakly similar toTERA HUMAN [H^a 


1.00 


1.49 


1.00 


407853 


AA336797 


Hs.40499 


dlckkopf (Xenopus laevis) homoiog 1 


1.00 


1.47 


1.00 


446119 


D29527 


Hs.290931 


ESTs 


1.00 


1.47 


1.00 


406471 






Target Exon 


1.00 


1.44 


1.00 


402110 






C18000178:gi|11990779|embJGAC19649.1|(A 


1.00 


1.42 


1.00 


407911 


AF104922 


Hs.41565 


growth differentiation factor 8 


1.00 


1.40 


1.00 


404829 






C1002937*:giI7499208|plr||T20993 hypothe 


1.00 


1.37 


1.00 


421925 


S80310 


Hs.109620 


acidic epidkiymai glycoprotein-iike 1 


1.00 


1.26 


1.00 


406076 


AL390179 




Homo sapiens mRNA; cDNA DKFZp547P134 (fr 


1.00 


1.19 


1.00 


458622 


AA972412 


Hs.13755 


f-box and WD-40 domain protein 2 


1.00 


1.09 


1.00 


416018 


AW138239 


Hs.78977 


proprotein convertase subliBsin/kexin t 


1.00 


1.04 


1.00 


409357 


M73628 


Hs.54415 


casein, kappa 


1.00 


1.03 


1.00 


436684 


AW976319 


Hs.94806 


ATP-bindlng cassette, sub-family A (ABC1 


1.00 


0.84 


0.84 


43617B 


BE152396 


Hs.21590 


hypothetical protein 0KFZp564O0523 


1.00 


0.91 


0.80 


402522 






C1 000 568*^i|f2697965|dbjJBAB2 1801.1| (A 


1.00 


0.80 


0.67 


405735 






ENSP00000252164':WAA1578 protein (Fragm 


1.00 


0.66 


0.56 


401905 






ENSPO0000252232*:Sterol regulatory eteme 


1.00 


0.65 


0.52 


404152 






C6000931*:gi|9558454|dbj!BAB03398.1| (AB 


1.00 


0.58 


0.51 


418693 


AI750878 


Hs.87409 


thrombospondin 1 


1.00 


0.B5 


0.51 


451375 


AI792066 


Hs.283902 


Homo sapiens BAC done RP11-481J13 from 


1.00 


0.46 


0.38 


430132 


AA204686 


Hs.234149 


hypotheBcal protein FU20647 


1.00 


0.84 


0.33 


456983 


AI081687 


Hs.11355 


thymopoietin 


1.00 


0.61 


0.29 


438681 


AW384815 


Hs.149208 


WAA1555 protein 


1.00 


0.60 


0.28 


409038 


T97490 


Hs.50002 


small Inducible cytokine subfamily A (Cy 


1.00 


0.39 


0.19 


409196 


NMJW1874 


Hs.334873 


carboxypeptidase M 


1.00 


0.43 


0.13 


410023 


AB017169 


Hs.57929 


stit (Drosophila) homoiog 3 


1.00 


0.30 


0.12 


420674 


NMJJ00055 


Hs.1327 


butyrylchoi'm esterase 


1.00 


0.30 


0.08 


415165 


AW887604 


Hs.78065 


complement component 7 


1.00 


0.08 


0.06 


425545 


N98529 


Hs.158295 


Homo sapiens, done MGC12401, mRNA, com 


1.00 


0.10 


0.01 


448256 


BE614149 


Hs.20814 


CGi-27 protein 


0.96 


1.32 


1.55 


417389 


BE26C954 


Hs.82045 


mldkine (neurite growth-promoting factor 


0.95 


3.15 


Z34 


403214 






NM.016232*:Homosarierainlerleukin 1 re 


0.94 


1.63 


2.51 


414799 


AI752416 


Hs.77326 


insulin-Eke growth factor binding prate 


0.92 


1.87 


1.60 


405665 


U22961 


Hs.184411 


albumin 


0.92 


1.09 


1.03 


401519 






C15000476*:gil12737279|refjXP„012163.1} 


0.88 


1.46 


3.44 


417501 


AL041219 


Hs.82222 


sema domain, immunoglobufin domain (Ig), 


0.87 


0.41 


0.50 


409632 


W74001 


Hs.55279 


serine (or cysteine) proteinase inhibito 


0.85 


1.38 


1.43 


405494 






C2001837 # :gi|12697903|dbj1BAB21770.1| (A 


0.83 


1.46 


4.65 


444171 


AB018249 


Hs.10458 


small indudbfe cytokine subfamily A (Cy 


0.80 


0.91 


0.91 


439706 


AW872527 


Hs.59761 


ESTs, Weakly similar to DAP1.HUMAN DEATH 


0.79 


0.58 


0.43 


436396 


A1683487 


Hs.152213 


wingless-type MMTV integration site farni 


0.77 


1.47 


Z37 


426716 


NM_006379 


Hs.171921 


sema domain, Immunoglobulin domain (Ig), 


0.75 


1.13 


1.18 


431347 


A1133461 


Hs.251664 


insulin-like growth factor 2 (somatomedi 


0.68 


2.61 


1.87 


413753 


U17760 


Hs.75517 


taminin, beta 3 (nicein (129(0), kafinin 


0.68 


2.70 


4.96 


426322 


J05058 


Hs.2012 


r/anscobatamin 1 (vitamin B12 binding pr 


0.67 


1.50 


1.36 


426514 


BE616633 


Hs.170195 


bone morphogenetic protein 7 (osteogenic 
apoDpoprotein L 


0.56 


£05 


2.46 


422282 


AF019225 


Hs.1 14309 


0.55 


3.91 


3.92 


409757 


NMJJ01898 


Hs.123114 


cystatin SN 


0.53 


2.72 


Z93 


427450 


AB014526 


Hs.178121 


K1AA0526 gene product 


0.52 


1.34 


1.97 


414555 


N98569 


Hs.76422 


phosphoBpase A2, group IIA (platelets, 


0.50 


1.04 


1.05 


423774 


L39064 


Hs.1702 


inlerleukin 9 receptor 


0.49 


2.81 


6.46 


404977 






InsuGn-fiKe growth factor 2 (somatomedi 


0.28 


5.78 


5.17 


428336 


AA503115 


Hs.183752 


microserninoprolein, beta- 


0.21 


1.47 


1.56 


451668 


Z43948 


Hs.326444 


cartilage acidic protein 1 


0.18 


4.05 


3.60 


428651 


AF196478 


Hs.188401 


annexin A10 


0.17 


5.14 


27.75 


421110 


AJ250717 


Hs.1355 


calhepsin E 


0.12 


5.49 


45.35 



BILEiS 

Pkey: Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

431146 32854 J Z83850 AA459717AW965384 AA333635 
434037 3791 8 J AF116601 AI110691 AF063566 



WO 03/003906 



PC17US02/21338 



TABLE 4C 

Pkey: Unique number corresponding to an Eos probesel 

Ref: Sequence source. The 7 digit numbers in this column are Genbank WemTfer (G!) numbers. "Dunham L et aJ." refers to the publication enlitted The DNA 

sequence of human chromosome 22/ Dunham L et aL, Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposiSon 


401271 


9797373 


Minus 


51292-61911 


401519 


6649315 


Pius 


157315-157950 


401905 


6671966 


Pius 


153965-154441,156599-156819 


402075 


8117407 


Plus 


121907-122035,122804-122921,124019-124161,124455-124610,125672-126076 


402110 


8131678 


Minus 


173889-174062 


402522 


9798493 


Plus 


20605-20731 


402727 


9211324 


Plus 


54596-54777 


403005 


5791501 


Minus 


16945-17053,20018-20403 


403106 


7331404 


Plus 


77162-77350,81338-81511 


403214 


7630945 


Minus 


76723-77027,79317-79484 


404152 


9884757 


Pius 


41111-41281,4549545716.47801-47910 


404488 


8113286 


Minus 


64835-64994 


404329 


6624702 


Minus 


4913-5093,7310-7469,9472-9621.9951-10082 


404860 


8979555 


Plus 


65852-66081 


404977 


3738341 


Nfinus 


4308143229 


405494 


8050952 


Minus 


70284-70518 


405735 


9931101 


Minus 


29854-29976 


405738 


9943998 


Plus 


4437045410 


405076 


9123123 


Phis 


89972-90319 


406397 


9256243 


Minus 


127317-127454 


406471 


9795566 


Plus 


87383475B9 
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TABLE SA: Genes unregulated in bladder cancer 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiO: Unigene number 

Unigene Title: Unigene gen8 title 

R1 90th percentile of bladder tumor A!s divided by the 90th percentile of normal body sample Als 



Pkey 


ExAccn 




Untoene Tiila 


R1 


459702 




yu.aiiu juuo~a i ou cUaybiis biaiizu in out o i 


11.03 


437915 


A16370M 


Hs 202312 


Homo carv/yv: dons Nil NTera?Dl \erakocs 


10.40 


404917 






Tomot (-yah 


9.65 


401066 






C11000517''ail7293105tablAAF48490 11 fAE 


9.00 


447475 


AI380797 


He 1 5RQQ7 
no. i ju33£ 


ESTs 


8.92 


427335 


AA448542 


Hs.251677 


G antigen 7B 


8.65 


450061 


AJ797034 


Ms 346238 


ESTs 


8.35 


401335 






Tampt Fxnn 


7.95 


424264 


Qgrj400 


Hs 239388 


Human DMA sfimianra fmm rJnnfl RP1-304R14 


7.35 


409041 


AB033025 


Hs 50081 


Hypolhs&cai protein, XP_051 860 (K1AA119 


7.20 


436608 


AA628980 




down syndrome critical region protein DS 


4.86 


451950 


rtVY£3£0 1 ' 


Hs 913307 


ESTs 


4.45 


406542 






T1 0Aftn75fl*-nil1 J^RfiSWknlOQYOOl 179*57 HI 1 


3.73 


437931 


AJ249468 


Hs. 124434 


ESTs 


170 


443133 




Hs41179 


ESTs 


3.60 


434487 


AF143867 


Hs.337588 


ESTs, Moderately simitar b S65557 alpha 


3\ 37 


402239 








3.37 


443162 


T49951 


Hs.9029 


DKFZP434G032 oratein 


3.21 


403383 






Target Exon 


3.13 


438315 


R5G795 


He. 8241 9 


ESTs 


3.04 


452827 


MIOf 1 OO J 


Hs.55468 


ESTs 


3.01 


41 6225 


AA577730 


Hs 18SS84 

110. luuuut 


FRTs Weaklv simHar hi PC4259 feiriiin 


3.01 


402948 






NM 025206,'Hojdo sapiens hypolheOcat prot 


291 


47CKH» 
*t£330J 


W92620 


He 9fifl85«i 


ESTs 


2.84 




MM n09R4Q 


rib. 




2.78 


WW/1 


Dt34U474 




lorxneao oox ivn 


9 75 

C.1 3 


>HTnno 
41 (Vvi 


At niann 


nS.oU/30 


belaine-homocystBine methyltransfarase 


9 7fl 




AA1*»7Q11 


He 799fln 


CO 15 


2.70 


4131UO 


iuk i n/wici 

lNIVl__UUdl Do 


nS. 1 30 lua 


nyuroxysierDuj \ 1 1 -oeiaj oenyaroy enase c 


9 Rfl 

£.00 




WV*H10Uf 


ns.iTWi i 


rnfV!«; //voll rftukitnn mirJa Q rontuic 
UU043 \QSu UIVBtun CyCJo *K), O.CoicVIS 


2.62 




Kl 1 HI 


ns. lyyoyo 


nypoineucai proteui 


9 fi1 


AiAOOi 


Avv«Kjuyi y 




nk-i ii u Dn_do_o_io_n_] ti t?i ur*i r/iAP Cu 


LXM 


Af\t*>f\C. 






r > iQnnfT71C*>niliCftftn'>7lfBl>MP AA^yf 1 A 41 v 

UiyuuUf Jo .gi|49vowf|n^i^juuJ4i4.i| z 


9 tiA 
Z.04 


439RA9 
90&041 


mvo/iuyo 


He 114R99 


nypotflBucej proiBin iwjl^kjo 


251 


49771 Q 


AHQ1.199 


He 11477fi 
ns.iot/10 


CO IS 


251 


ISO/iff 






nh-I) ?J4Tn711-?fi04flflLn76-Po4 BT0731 Hnmn 


250 


41ARfV7 

*M«KJU/ 


AI7UlR4.fi 


He 7714 ft 

ns./ 1 wo 


nyuiOAjpiujiaijiaJKiiii ucnyuiuyt^i idoc io-^rM 


249 




Awnninfifi 

miUUWOO 


ns. locos/ 


T Jww 9 

1*VUA £ 


249 








Target Exon 


248 


417997 


AA41R1RA 


Hs.23017 


Umn somen* rDKIA' P1J99747 fis clonp K 


248 


A1C7G9 
<H3/3£ 


nrii4C7A 


He 7077 R 
ns.ro/ /o 


puiouve u djuiiia iiuioJic proicui 


O AG 


Hi 1^40 


AAD3 1300 


HS.O04OU3 


norrto sapiens cltv\ ruj hwo 115, oone nt 


243 




MM (XTlAAACL 
PlMJ WW* 3 


113.0/ yo 


tpnoo 


949 


433078 


AW015188 


Hs.121575 


Homo sapiens cDMA FL) 12231 6s, dona MA 


240 


425997 


AK000086 


Hs.165948 


hypothetical protein FU20079 


238 


418322 


AA264166 


Hs.84113 


cycfirwlependent kinase inhibitor 3 (CDK 


235 


452012 


AA307703 


Hs.279766 


ki resin family member 4A 


234 


445600 


AF034803 


Hs.12953 


PTPRF Interacting protein, binding prole 


233 


418941 


AA452970 


Hs.239527 


E1B-55kDa-associated protein 5 


233 


454609 


AW810204 




gb:MR4-ST0125^)21 1 994)1 7-d08 ST0125 Homo 


2.28 


444476 


AF020038 


Hs.11223 


isodtrate dehydrogenase 1 (NADP), sdub 


226 


420005 


AW271106 


Hs.133294 


ESTs 


222 


439826 


NMJJ14965 


Hs.6705 


KIAA1 042 protein 


222 


405531 






Target Exon 


221 


436569 


BE439539 


Hs.279837 


glutathione S-transferase M2 (muscle) 


218 


404394 






ENSP00000241075:TRRAP PROTEIN. 


217 


427479 


8E410092 


Hs.178471 


KlAA0798gen9 product 


217 


435904 


AF261655 


Hs.8910 


1,2-alpha-mannosidase IC 


213 


431620 


AA126109 


Hs.264981 


2'-ff-o!igoadenylate synthetase 2 (69-71 


212 


426682 


AV660038 


Hs.2056 


UDP grycosyttransferase 1 tamfly, porype 


210 


422765 


AW409701 


Hs.1578 


bacutoviral IAP repeat-containing 5 (sur 


210 


451385 


AA017656 




gb2e39h01 j1 Soares retina N2b4HR Homo 


209 


403477 






C3002160*:gif7662420|reflNP,055736M| Kl 


209 


417151 


AA194055 


Hs.293858 


ESTs 


208 


448262 


AWB80830 


Hs.186273 


ESTs 


207 


415192 


D17793 


Hs.78183 


atdo-keto reductase family 1, member C3 


204 


402994 






NM_002463*:Homo sapiens myxovirus (intlu 


204 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



426053 


U68105 


Hs.172182 


poly(A}-binding protein, cytoplasmic 1 


2.02 


423271 


W47225 


Hs.126256 


interteuHn 1, beta 


Z01 


419741 


NM007019 


Hs.93002 


ubiquilin carrier protein E2-C 


1.99 


407581 


R48402 


Hs.173508 


P3ECSL 


1.95 


410197 


NM.005518 


Hs.59889 


34iydroxy-3-methy1glutary1-CMnzynie A sy 


1.95 


427122 


AW057736 


Hs.323910 


HER2 receptor tyrosine kinase (c-erb-b2, 


1.93 


436481 


AA379597 


Hs.5199 


HSPC150 protein sim3ar to ubiqulfin-oon 


1.93 


436251 


BE515065 


Hs.295585 


nucleolar protein (KKE/D repeal} 


1.89 


401961 






NM_02l628:Homo sapiens serine carboxypep 


1.86 


434042 


AI 589941 


Hs.8254 


Homo sapiens, Similar to tumor different 


1.85 


447532 


AK000614 


Hs. 18791 


hypothetical protein FU20607 


1.84 


418526 


BE019020 


Hs.85838 


solute carrier family 16 (monocarboxytlc 


1.83 


429612 


AFC62649 


Hs.252587 


pituitary tumor-transforming 1 


1.80 


422164 


NM_014312 


Hs.1 12377 


cortic al thymocyte receptor (X. laevis 


1.80 


422247 


U18244 


Hs.113602 


solute carrier family 1 (high affinity a 


1.79 


410407 


X66839 


Hs.63287 


carbonic anhydrase IX 


1.78 


412115 


AK001763 


Hs.73239 


hypothetical protein FL) 10901 


1.77 


414609 


Ai 434699 


Hs.77356 


transferrin receptor (p90, C071) 


1.75 


432210 


AI567421 


Hs.273330 


Homo sapiens, clone I MAG E:3 544662, mRNA, 


1.74 


427239 


BE270447 


Hs.174070 


ubiquitin carrier protein 


1.74 


459198 


A1086347 


Hs.151138 


ESTs 


1.74 


421066 


AU076725 


Hs.101408 


branched chain aminotransferase 2, mi toe 


1.71 


424687 


J05070 


Hs. 151738 


matrix meiailoproteinase 9 (gelatinase B 


1.70 


450663 


H43540 


Hs.25292 


ribonuclease HI, large subunit 


1.70 


417324 


AW265494 




ESTs 


1.67 


453883 


A1638516 


Hs.347524 


cofactor required for Sp1 transcriptkma 


1.66 


428000 


R35145 


Hs.291904 


accessory proteins BAP31/BAP29 


1.65 


450635 


AW403954 


Hs.25237 


mesenchymal stem cell protein DSCD75 


1.63 


423397 


NM 001833 


Hs.1 652 


chemokine (C-C motif) receptor 7 


1.62 


415440 


D83762 


Hs.78442 


SREBP CLEAVAGE-ACTIVATING PROTEIN 


1.62 


428028 


U52112 


Hs.182018 


intsrteukin-1 receptor-associated kinase 


1.62 


426783 


219084 


Hs.172210 


MUF1 protein 


1.62 


445937 


A1452943 


Hs.321231 


UDP-GakbetaGtcNAc beta 1,4* galactosyl! 


1.61 


445462 


AA378776 


Hs.288649 


hypothetical protein MGC3077 


1.60 


400965 






C11002190*:gt|12737279|refjXP_0121 63.1| 


1.59 


432269 


N&L002447 


Hs.2942 


macrophage stimulating 1 receptor (c-met 


1.59 


429578 


AJ 969028 


Hs.99389 


ESTs 


1.59 


449027 


AJ271216 


Ks.22880 


dipeptidylpeptidase 111 


1.59 


431840 


AA534908 


Hs.2860 


POU domain, class 5, transcription facto 


1.59 


417900 


BE250127 


Hs.82906 


CDC20 (cell division cycle 20, S. cerevi 


1.59 


429002 


AW248439 


Hs.2340 


junction plakogtobin 


1.57 


442410 


AW996503 


Hs.197680 


ESTs 


1.56 


407601 


AC002300 


Hs.37129 


sodium channel, nonvoltage-gated 1, beta 


1.55 


418543 


NMJJ05329 


Hs.85962 


hyafuronan synthase 3 


1.54 


424611 


NM.001421 


Ks.151139 


E74-0ke factor 4 (ets domain transcript 


1.54 


414732 


AW410976 


Hs.77152 


minichromoscme maintenance deficient (S. 


1.54 


408930 


AA146721 


Hs.334686 


hypothetical protein FU21588 


1.53 


448993 


AM71630 


Hs.8127 


K1AA0144 gene product 


1.52 


414053 


BE391635 


Hs.75725 


trans g eli n 2 


1.51 


433662 


W07162 


Hs.150826 


CATX-8 protein 


1.50 


432562 


BE531048 


Hs.278422 


DKFZP586G1 122 protein 


1.50 


402260 






NM_001436*:Homo sapiens fibriilarin (FBL 


1.48 


426127 


L36983 


Hs.167013 


dynamin 2 


1.48 


427557 


NMJJ02659 


Hs.179657 


plasminogen activator, urokinase recepto 


1.48 


418026 


BE379727 


Ks.83213 


fatty add binding protein 4, adipocyte 


1.47 


418960 


NMJJ04494 


Hs.89525 


hepatoma-derfved growth factor (high-mob 


1.46 


428293 


BE250944 


Hs.183556 


solute carrier family 1 (neutral amino a 


1.46 


432344 


A1476474 


Hs.248156 


ESTs 


1.46 


453449 


W16752 


Hs.32981 


sema domain, immunoglobulin domain (lg), 


1.46 


450690 


AA296696 


Hs.333418 


FXYD domain-containing ton transport reg 


1.46 


441940 


AW2J8115 


Hs.1 281 52 


ESTs 


1.45 


409893 


AW247090 


Hs.57101 


minichromosome maintenance deficient (S. 


1.44 


439318 


AW837046 


Hs.6527 


G protein-coupled receptor 56 


1.42 


422565 


BE259035 


Hs.1 16400 


singed (DrosophitaHike (sea urchin fas 


1.41 


428928 


BE40S838 


Hs.194657 


cadherin 1, type 1, E-cadhenn (epiflieB 


1.41 


445417 


AK001058 


Hs.12680 


Homo sapiens cDNA FU1019S fis, done HE 


1.39 


441565 


AW953575 


Hs.303125 


p53-induced protein PIGPC1 


1.37 


439180 


AI393742 


Hs.199067 


v-erb-b2 avian erythroblastic leukemia v 


1.35 


418399 


AF131781 


Hs.84753 


hypothetical protein FU12442 


1.32 


432636 


AA340854 


Hs^78562 


claudin7 


1.32 


439053 


BE2445B8 


Hs.6456 


chaperonin containing TCP1, subunit 2 (b 


1.32 


413762 


AW411479 


Hs.848 


FK505-blnding protein 4 (59kO) 


1.31 


453914 


NM_000507 


Hs.574 


fnjctose-1,6-bts phosphatase 1 


1.28 


430056 


X97548 


Hs.228059 


KRAB-associated protein 1 


1.24 


451524 


AK00H66 


Hs.26516 


hypothetical protein FL1 10604 


1.23 


431441 


U81961 


Hs.2794 


sodium channel, nonvoltage-gated 1 alpha 


1.20 


439863 


BE547830 


Hs.9408 


paired trnmunogbbuSn4ike receptor beta 


1.19 
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451541 


BEZ79383 


Hs.26557 


plakophOin 3 


1.16 


406906 


Z25424 




gb:H.sapiens protefn-serine/lhreonlnekl 


1.11 


429504 


X99133 


Hs.204238 


Opocafin 2 (oncogene 24p3) 


1.00 


414002 


NMJJ06732 


Hs.75678 


FBJ murine osteosarcoma viral oncogene h 


0.B6 


431369 


BE184455 


Hs.251754 


secretory leukocyte protease inhibitor ( 


0.61 



TABLE SB 

Pkey: Unique Eos probesel identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 
414221 

417324 
436608 
451385 
454609 
455797 

TABLE 5c' 



CAT Number Accession 

142696 1 AW450979AA136653 AA136656 AW419381 AA984358 AA492073 BE1 68945 AA809054 AW238038 BE01 1 21 2 BE01 1359 BE01 1367 

BE011368 BE011362 BE01 1215 BE01 1365 BE011363 

166714.1 AW265494 AA455904 AA195677 AW265432 AW991605 AA456370 

42361.3 AA628980 A1126603 BE504035 

86787J AA017656 AA017374 AA019761 

1226517.1 AW810204AW810555AW810196AW810619AW810507 

1366826.1 BE091833 BE091874 BE091871 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled "The DNA 

sequence of human chromosome 22.' Dunham I. et al., Nature (1999) 402:469-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400965 


7770576 . 


Minus 


173043-173564 


401066 


8217435 


Ptus 


71448-71574 


401335 


smm 


Pfus 


15736-16352 


401961 


4581193 




124054-124209 


402239 


7690131 


Plus 


38175-38304,4213342266 


402260 


3399665 


Minus 


113765-113910,115653-115765,116808-116940 


402305 


7328724 


Plus 


40832-41362 


402948 


9368458 


Minus 


143456-143626,143808.143935 


402994 


2996643 


Minus 


4727-4969 


403383 


9438267 


Minus 


119837-121197 


403477 


9958251 


Ptus 


111834-112008 


404394 


3135305 


Minus 


37121-37205,37491-37762,4105341140,41322-41593.4177341919 


404917 


7341851 


Plus 


4933049498 


405531 


9565194 


Plus 


35602-35803 


406387 


9256180 


Plus 


116229-116371,117512.117651 


406542 


7711499 


Plus 


117335-118473 
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TABLE 6A: Genes uprequlated in bladder cancer 

Ptey: Unique Eos probeset idenBfter number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unsgene number 

Unigene Title: Unigene gene title 

R1 90lh percentile of bladder tumor Als divided by the 90th percentile of normal urothetium biopsy Ais 

R2 90th percentile of bladder tumor Als divided by the 90th percentile of normal urothelium bbpsy and normal bladder Als 



Pkey 




UnviPnpin 


llnlnpnp Tiflp 
uinyciic line 


R1 


R2 




AWniAB75 


ns. io/uui 


CO IS 


I 1 .0 ) 


11.31 


413324 


V00571 




rwHwilmnin rplpaeinn hnrmnnp 
wji uuuuufjui ictcooiiiy iiwiiiuiic 


9.15 


9.15 


421110 


AJ9W717 


Hs.1355 


CdUlcpSUl C 


9.07 


9.07 




now £\j 


He PI RQ? 


nJAAU iu i gene piuouct 


a en 

o>w 


a cn 
o.ou 


418406 


X73501 


Hs 84905 


pvtnkpraHn 90 


8.10 


8.10 


446619 


AU076B43 


Hs.313 


spcreled nhosnhnfwotein 1 (osteooontin 


7.98 


7.98 


433001 


AF217513 


Hs 27QQD5 


dons HOflTIOPROftHODl 


7.67 


7.67 


408243 


Y00787 


Hs.624 


intarteukin 8 


7.56 


7.56 


416065 


BE267931 


Hs 78996 


proflferating cell nuclear antigen 


7.17 


5.17 


425397 


J04088 


Hs 156346 


loooisomerase (DNA) 11 atoha f170kD) 


7.17 


8.24 


414163 


AW957446 


Hs.301711 


ESTs 


7.14 


4.62 


418007 


M13509 


Hs.83169 


matrix melalloptoteinase 1 (interstitial 


7.12 


7.12 


426125 


X87241 


Hs. 166994 


FAT tumor suppressor (Drosophila) homdo 


6.90 


3.38 


427337 


Z46223 


Hs 176663 


Fc fragment of IgG, low affinity lllb, r 


6.85 


4.98 


441633 


AW958544 


Hs. 11 2242 


normal mucosa of esophagus specific 1 


6.42 


6.42 


438091 


AW37!B)62 




nuclear receptor subfamily 1, group 1, m 


6.32 


6.32 


413053 


ALD15737 


Hs.75184 




6.09 


3L67 


414219 


W20O10 


Hs.75823 


ALL 1 -fused gens from chromosome 1q 


5.93 


4.68 


405033 






C1002652*tiil544327lsDlQ04799lFMO5 RABIT 


5.84 


5.84 


413132 
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AW384371 AA852212 R58704 AW366566AW364859 AA025851 M025852 M4551IX) M7 19958 AW3^2220AW996245 BE1 65351 BE073467 
AA377127 AW890264 AW609750 AW391912 AW849690 T87267 AW853812 AA852213 W74149 BE009090 AA056401 H9101 1 AW368529 
AW390272 C18467 AW674920 N57176 AA026480 AW576767 H93284 AA026863 AW1 77787 AA026654 AW1 77786 8E092134 BE092137 
BE092136 AW177784 AI022862 BE091653 AW376811 AW848592 AAO40018 BE185331 BE182164 AA368564 AW951576 T29918 AA131077 
W95048 W2545B AW205789 H90899 N29754 W32490 R20904 BE167181 8E167165 N84767 H27408 H30146 AI190590 C03378 AJ554403 
A1205263 AA128470 AJ392926 AF139065 AW37081 3 AW370827 AW798417 AW798780 AW798883 AW798569 R33557 AA149190 C03029 
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AW177783 AA0888B5 AW370829 AA2476B5 BE002273 AI76081 6 AI43910I AW879451 AI700963 AA451923 AI340326 AI590975 T48793 
A1568096 Al 142B82 AA039975 A1470146 AA946936 BE067737 BE067786 W19287 AA644381 AA702424 AI41 7612 AI306554 AI686869 
AI568892 AW190555 AI571075 AI220573 AA056527 AI471874 AI304772 AW517828 AI915596 A1527383 A3270345 AW021347 AW1 65807 
AW105614 AI346078 AA552300 W95070 AI494069 AI91 1702 AA149191 AA026864 A1830049 AI887258 AW780435 A1910434 A1819984 
AI858282 AI078449 AI025932 AIB60584 AI635B78 AA026047 AA703232 D12062 AW192085 AA65B154 AW514597 AW591892 T87181 
AA782066 AW243815 AW1 50038 AW268383 AW004633 AI927207 AA782109 AW473233 A1804485 AW169216 AI572669 AA602182 
AW015480 AW771865 AI270027 AA95181 6 AA283207 AI076962 AI498487 AI348053 Af783914 H44405 AW7991 18 AA1 28330 AA515500 
AA918281 W02156 A1905927 AA022701 W38382 R20795 T77861 AW860878 

433470 3672 1 AW950564 AA092457 T55890 D56120T92525 AI815387 BE182S08 BE182595 AW080238 M90657 AA347236 AW961686 AW176445 

AA304671 AW583735 T61714 AA316968 AI446615 AA343532 AA083489 AA488005 W52095 W39480 N57402 D82638 W25540 W52847 
082729058990 BE619182AA315188AA308636AA112474W76162AA088544 H52265AA301631 H80982AA113786BE620997 AW651691 
AA343799 BE613669 BE547180 BE546656 F11933 AA376800 AW239185 AA376086 BE544387 BE619041 AA452515 AA00180B AA190873 
AA180483 AA159546 F00242 A1940809 AI94O602 AI189753 T97653 T661 10 AW062896 AW06291 0 AW062902 AI051622 AI828930 AA1 02452 
A1685095 A1819390 AA557597 AA383220 A1804422 A1633575 AW338147 AW603423 AW606800 AW750567 AW510672 AI25Q777 AA083510 
AW629109 AW51 3200 AA921 353 A1677934A1 148698 Al 955858 AA1 73825 AA453027 AI027865 AW375542 AA454099 AA733014 A1591384 
R79300 R80023 AA843108 AA62605B AA844898 AW375550 AA88901 8 AI474275 AW205937 AI052270 AW3881 17 AW3881 1 1 AA699452 
AI242230 N47476 H3817B AA386621 AA1 13196 AA1 30023 H39740 T61629 A1885973AW083571 AA1 79730 AA305757 A1285455 N83956 
AA216013 AA336155 AW999959 T97525 AA345349 T91 762 AA771981 AI285092 A1591386 BE392486 BE385852 AA6B2601 AI682884 
AA345840 T85477 AA292949 AA932079 AA098791 D82607 T48574 AW75203B C06300 

438091 44964 1 AW373062T55662AI299190BE174210AW579001 H01811 W40186 R67100 Al 923886 AW9521 64 AA528440 AW898607 AW898616 

AA709126 AW898828 AW898544 AA947932 AW89B625 AW898822 AI276125 AJ185720 AW510698 AA987230 T52522 BE467708 AW243400 
AW043642 AI288245 All 86932 D52654 D55017 D52715 D52477 D53933 D54679 AI29B739 AJ146984 AI922204 N98343 BE174213 AA845571 
AJ813854 AI214518 AJ635262 AJ 139455 AJ707807 AJ698085 AW884528 AJ024768 AJ004723 AW087420 AI5651 33 N94964 AI268939 
AW513280 AI061 126 AI435818 AI859106 AI360506 A1024767 AA513019 AA757598 X56196 AA902959 AI334784 AI860794 AA010207 
AW890091 AW513771 AI951391 AI337671 T52499 AAB90205 AI640908 H75966 AA463487 AA358688 A1961767 A186B295 AA780994 
A19B5913 BE174196 AA029094 AW592159T55581 N79072 A1611201 AA91081 2 AI22071 3 AW1 49306 AJ75841 2 AA04571 3 R79750N76096 

TABLE 6C ; 

Pkey: Unique number corresponding to an Eos probeset 

Ref. Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al/ refers to the publication entitled "The DNA 

sequence of human ctirornosome 22/ Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Ptey 


Ref 


Strand 


Nt position 


400750 


8119067 


Plus 


198991-199168,199316.199548 


400843 


9188605 


Plus 


5863-5970J653-7784 l 8892-9023 l 9673-9807,10634-10789,1 5254-15403,23827-23958 


401093 


8516137 


Minus 


22335-23166 


401192 


9719502 


Minus 


69559-70101 


401451 


6634068 


Minus 


119926-121272 


401780 


7249190 


Minus 


28397-28617.28920-29045,29135-29296,2941 1-29567,29705-29787,3022440573 


401781 


7249190 


Minus 


83215-83435,83531^656,83740^901,84237-84393.84955^5037.86290^6814 


402230 


9966312 


Minus 


29782-29932 


402727 


9211324 


Plus 


54596-54777 


404875 


9801324 


PJUS 


96588-96732,97722-97831 


404977 


3738341 


Minus 


43081-43229 


405033 


7107731 


Minus 


142358-142546 


405506 


6466489 


Plus 


80014-80401,8059341125 
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TABLE 7A: Genes downregulated in bladder cancer 

Ptey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number. Genbank accession number 

UnigenelD: Unigena number 

Unigene Title; Unigene gene liOe 

R1 90th percentile of norma) uroifieffum biopsy Als dlvfried by 75th percentile of bladder tumor Ais 

R2 901h percentile of normal urothellum biopsy and normal bladder Als divided by the 90th percentile bladder tumor Als 



Pkey 


ExAccn 


urttgeneiu 


Unigene Title 


R1 


R2 


4U0U lU 




C21000152:c^6226483lsp|Q52118|YMO3_ERWS 4.86 


2.49 


426796 


S78234 


U- 4 mine 

HS.1 72405 


cell division cycle 27 


4.28 


£.40 




AA577730 


MS. 1 00004 


ESTs, Weakly similar to PC4259 ferritin 


4.04 




4o3UUo 


AWz3oodi 


HS.Z7/11 


WoH-Hirschhom syndrome candidate 1-lik 


3.82 


£.00 


4U43i/ 






Target Exon 


3.78 


0 nn 


4Z040B 


vrtMcn 

AUJJOU 


ns.4 


afcohol dehydrogenase 1B (class I), beta 


264 


1.79 


413040 


AA9A417n 




gb:rtc05h02.s1 NCLCGAP_Pr1 Homo sapiens 


2.63 


3.42 


453180 




ns. 1 iiHf 0 


ESTs, Highly similar to T42626 secreted 


2.32 


3.24 




MM fWtftRI 


PIS. 1340/3 


WNT1 inducible signaling pathway protein 


Z20 


3.80 


4010*3 


AIQ1 TQM 
A131 /3U1 


ns.tU004i 


ESTs 


118 


3.69 


41 f U/D 


AW y/ 04 04 


Up 99fld/9 

nS.4«J044£ 


ESTs, Moderately similar to ALU7_HUMAN A 


Z03 


3.03 


4*0400 


loll ID 


nS.£/U04U 


ESTs 


2.00 


5.17 


/cnr<c 
4 DUO it) 






biphenyl hydrolase-JDce (serine bydrolas 


1.89 


2:46 


4010/0 


AlA/flT79fiQ 
AWOJ/ZOO 




Homo sapiens mRNA; cDNA DKFZp586M2022 {f 


1.70 


2.79 


452123 


AIOC7C1 C 

Al^o/bio 


u» oonoo 


ESTs 


1.69 


£.40 


muni 


WloUAJ 


U, 4 C7QQQ 

ns.lo/yoo 


neural cell adhesion molecule 1 


1.65 


A R7 
4.0/ 


437501 


AA761546 


Li, OAQOAA 
HS.Z40044 


ESTs, Weaidy similar to ALU1_HUMAN ALU 5 


1.50 


0.04 


402096 






ENSP00000217725*:Lamtnin alpha-1 chain p 


1.48 




403000 


AIU 10/00 


He \0 AM 


glyceronephosphate O-acyl transferase 


1.47 


3.22 


412810 


M21574 


Hs.74615 


platelet-derived growth factor receptor, 


1.46 






A WR1 9AB1 
flWD I 1401 


Ue iriAirv; 
rts.iU4iuo 


ESTs 


1.39 


2.89 


414U00 


A I 0707/17 

ALU/ 3/U/ 


u«» in7/ii^ 
ris.cU/440 


hypothetical protein MGC10848 


1.36 


2.80 


4000 f£ 


ai rtAftfiCQ 

ALU40003 


Ue 7/117 
HS.04U/ 


protein kinase (cAMP-dependent, cataryti 


1.35 


3.49 


4 IJOUO 




ns.oioo 1 1 


Homo sapiens cDNA FU23176 its, clone L 


1.34 


2.93 


^9/141*5 

4£V4|£ 


AVY3/00/4 


Uc 19S1/M 


ESTs 


1.32 


5.13 


4Z14U© 


ATI/ 303/ 


u« iniirw 
ns.iiwiuo 


Mais (mouse) homotog 2 


1.31 


4.07 


417446 


Al 4 4QC74 


ns.tKloo 


monoamine oxidase B 


1.27 


L.00 


4R9QOC 


AJ4/OIOU 


Ue *T7G1 
MS. 1 J/01 


ESTs 


1.26 


1.95 


446808 


AA/OoZcO 


Hs.16193 


Homo sapiens mRNA; cDNA DKFZp586B21 1 (fr 


1.25 


0.44 


443105 


X96753 


Hs,9004 


cbondrd tin sulfate proteoglycan 4 (mela 


1.24 


9/17 


421348 


M94048 


Hs. 103724 


peripheral myelin protein 22 


1.24 


C.00 


433070 


N75346 


Hs.306121 


CDC20 (ceil division cycle 20, S. cerevi 


1.23 


1 on 
£oU 


420059 


A inn 

AF161486 


Hs.94769 


RAB23, member RAS oncogene family 


1.22 


•J A1 
0.40 


408491 


AJ088063 


Hs.7882 


ESTs 


1.20 


O.U1 


447384 


AI377221 


HS.4052B 


ESTs 


1.00 


7 09 


421998 


R74441 


Hs.1 17176 


poly(A)-binding protein, nuclear 1 


1.00 


7.00 


409619 


AK001015 


hs.55220 


8CL2-associeted alhanogene 2 


1.00 


e A(\ 
D.4U 


444795 


AJ193355 


Hs. 160316 


ESTs 


l.UU 


C CO 

0.30 


408495 


UVTCQ70C 
WOO/30 


ms.zj (fit 


ESTs 


1.00 


c nc 
O.vO 


417124 


RP1997R9 
DC UX 




ESTs 


1.00 


4.73 


443998 


AI620661 


Hs.296276 


ESTs 


1.00 


4.39 


406303 






C16000922:gip499103lpirirT20903 hypothe 


1.00 


4.37 


422994 


AW891802 


Hs.296276 


ESTs 


1.00 


4.37 


422195 


AB007903 


Hs.113082 


KlAA0443gen8 product 


1.00 


4.35 


452877 


A1250789 


Hs.32478 


ESTs 


1.00 


3.90 


452487 


AW207659 


Hs.6630 


Homo sapiens cDMA FU 13329 fis, clone 0V 


1.00 


3.90 


417159 


R01761 




gb.-yeB1f10^1 Soares fetal Bver spleen 


1.00 


3.82 


445607 


AA488107 


Hs.30156 


ESTs, WeaWy similar to unnamed protein 


1.00 


162 


405274 






Target Exon 


1.00 


3.59 


410611 


AW954134 


Hs.20924 


K1AA1628 protein 


1.00 


106 


426495 


NMJJ01151 


Hs^043 


solute carrier family 25 (mitochondrial 


1.00 


2.89 


422292 


A1815733 


Hs.114360 


transforming growth factor beta-sMat 


1.00 


Z61 


413040 


AA193338 


Hs.12321 


sodium calcium exchanger 


1.00 


2.51 


429623 


NMJJ05308 


Hs.211569 


G protein-coupled receptor kinase 5 


1.00 


2.05 


456607 


AJ660190 


Hs.105070 


cyciin-dependent kinase Inhibitor 1 C (p5 


1.00 


2.01 


429143 


AA333327 


Hs.197335 


plasma gtutamate carboxy peptidase 


0.97 


2,45 


400288 


X06256 


Hs.149609 


Integrln, alpha 5 (fibronecfin receptor, 


0.90 


2.47 


442498 


U54617 


Hs.8364 


Homo sapiens pyruvate dehydrogenase kina 


0.88 


5.08 


414449 


AA557660 


Hs.76152 


decorin 


0.88 


ai3 


412014 


A1620650 


Hs.43761 


ESTs, WeaWy similar to A46010 X-Hnked 


0.78 


1.88 


425100 


AF051850 


Hs.154567 


supervBSn 


0.70 


3.90 


432094 


A1658580 


Hs.61426 


Homo sapiens mesenchymal stem cell prole 


0.68 


3.41 


42781B 


AW511222 


Hs.193765 


ESTs 


0.63 


3.75 
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CAT number. Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

417159 1653899 1 R01761 R01760 N49787 

419543 185745 T AA244170 AI018087AA244355 

450515 83710 1 AW304226AW008420AA349212H15015AA317021 AI829484 H25661 H81744A1906147AA837938AW16776SAW603578AW842359 

BE439926 AA902417 AW235409 AA010062 AW069319 AI280242 AW672925 H06848 H05608 R51905 R45023 AW675471 H28475 AI085597 
A119781 5 AI825355 N991 34 AI075956 A14701 22 AA449985 AW662833 AA860423 AA91 3342 T23825 AI394207 AI31 0319 T32467 A1589870 
AI682293 A1810633 BE223045 H14620 AA626645AA876023 T33571 AA953982 AI138631 H15Q16 A1304356 AA983B31 AI350990 All 43993 
AI708171 AA525961 H26247 W38485 AA847598 H81745 AW855485 BE299805 AI079409 A1278050 AI223168 AI860904 AW025415 AI339003 
AA393692 AI354302 A1492838 N80194 A101 5651 N34543 BE295397 AI085154 W24135 



TABLE 7C 

Pkey: Unique number corresponding to an Eos probese! 

Ret Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication entitled "The DMA 

sequence of human chromosome IV Dunham I. et al. Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted, 
NLposta: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposiiion 


402096 


8117697 


Minus 


24993-25186 


403010 


3132346 


Plus 


78385-79052 


404917 


7341851 


Plus 


49330-49498 


406274 


7543787 


Plus 


932-1123 


406303 


8575868 


Plus 


173622-173786 
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TABLE 8A: Genes predictive of bladder cancer progression 

Ptey: Unique Eos probeset identifier number 

ExAccru Exemplar Accession number, Genbank accession number 

UnigenelO. Unigene number 

Unigene Title: Unigene gene title 

R1 8Wh percentile of Ta or T1 tumor Als from patients who upstaged divided by 80th percentile of Ta or T1 tumor Als from patients who did not upstage 
R2 median of Ta or T1 tumor Als from patients who upstaged divided by the median of Ta or T1 tumor Als from patients who did not upstage 



Rcey 


ExAccn 


UnigenelD 


Unigene Title 


D4 
Kl 


D9 

Kit 


413324 


V00571 


Hs.75294 


corticotropin releasing hormone 


8.30 


/ 10 

4.IO 


437802 


AI475995 


1). J OAA4 n 

Hs. 122910 


ESTs 


7.51 


9 cn 


444444 


AJ149332 


Hs, 14855 


ESTs 


Z58 


1 

1.00 


445033 


AV652402 


u_ ■» inn 4 

Hs.72901 


mucin 13, epithelial transmembrane 




1 n 


417771 


AA804698 


Hs.82547 


retinojc acid receptor responder (tazaro 




C 9^ 
O.OJ 


449618 


AIQ76459 


Hs. 15978 


1/1 * A 4 (VIA 1 • 

kiaaizyz protein 


o 7n 


9 99 


artoai 


Mio/zo 




gbiHuman nonspecHic cross reacting antig 




1.90 


Aitma 
41oolo 


1 M7719 


nS.o4U/Z 


transmembrane 4 superfarrnly member 3 




1.00 


AOIAAi 

4ZJ441 


R68649 


U. 17Q^CO 

ns.Z/Wtiy 


absent in melanoma 1 Eke 


2.44 


Z40 


405033 






Pinn9fiC9*vii.CAA997 - cnli'Yl/i7QaiCTilfV. RARIT 


i.ro 


3.48 


423024 


A ACDQT11 

AAbyJ/ Jl 




COTe (.JUHarotohi similar In Al 1 IC UIIMAKl A 

to p s, Moaeraieiy similar to ALuo_nUiviAP. h 


J. it 


1.28 


4.2 E>1 io 


Al lfl7CG1 1 

Auurboii 


Ut- 1CV1R79 

nS.lo40f Z 


methylene tetrahydrofolate dehydrogenase 




Z78 


437928 


ML 1 AOCJ7C 

NM_GU547d 


Hs.5920 


UDP^ace^lgtucosamine-2-epimerase/TJ-8c 


9 on 


1 C9 
1.0 J 


446584 


U53445 


Hs. 15432 


down regulated in ovarian cancer 1 


1 "3(3 
1.00 


9 99 


>occna 






down syndrome critical region protein DS 




4.53 


404440 






NM_021 04&Homo sapiens melanoma antigen, 


9 PA 


1.00 


435602 


AF217515 


Hs.283532 


uncharacterized bone marrow protein 8M03 


o cm 


1 no 
Lira 


Art a nnn 

424098 


AF077374 


Hs. 139322 


small prdtrte-rich protein 3 


o Ay 
£4/ 


9 RA 
J.04 


437553 


AioZ&uo 


ns. 13043/ 


to is, weaKiy similar id mai o_mumajn wiluk 


9 no 


n Q1 


4ZoQ3o 


AtAmco on*. 

AVVudojUZ 




Homo sapiens hikna ior caioesmon, s u i k 


9 71 


4.00 


446839 


BEOS 1926 


•i 

Hs.16244 


mitotic spindle coiiec-cou related prot 


1 10 
J.10 


9 99 


4404/y 


UQCtf R 

riyoi Io 


LJ- HTQl 

ns./ 1 zyj 


uurHN-acieyiyiucosainine pynjpnospnorywo 


Z61 


1.81 


4izuoy 


A AH 700 

AAJi fvod 


nS.z4y/Zi 


CCTe KAnrlarilahi cimtlar In Pf'JlO'.Q £orri 

to I s, rwooeraiejy sirnuar to rwas rem 


1 on 


Z02 


401241 


a nivinrvon 




rnitogen-actrvaied protein kinase 8 inter 


1 OR 
I. ZD 


9 cc 

Z.00 


408741 


M73720 


Hs.646 


carboxy peptidase A3 (mast ceO} 


9 19 


1 99 


415989 


Al 267700 




ESTS 


1 cn 


1 nr. 


431070 


AW408164 


1 1 AJA4 fW 

Hs. 2491 84 


transenption factor 19 (SC1) 


1.73 


1 cn 

l.OU 


452140 


AB007928 


Hs.28169 


K1AA0459 protein 


Z44 


Z95 


443162 


T49951 


Hs.9029 


DKFZP434G032 protein 


2.98 


1.00 


435904 


AF261655 


Hs.8910 


1 ,2-alpha-mannosidase IC 


2.56 


1.20 


407379 


AA332127 


Hs.325804 


transcription factor 1 7 


Z10 


1.72 


442712 


BE465168 


Hs.1 31011 


ESTs 


Z54 


2.72 


411678 


AI907114 


Hs.71465 


squatene epoxidase 


1.12 


3.11 


406791 


A1220684 


Hs.347939 


hemoglobin, alpha 2 


1.69 


1.38 


431805 


NM_014053 


HSJ270594 


FLVCR protein 


1.92 


Z05 


438414 


AA806794 


Hs.131511 


ESTs 


1.04 


Z15 


413924 


AL1 19964 


Hs.75616 


seladin-1 


1.69 


Z05 


437679 


NM-014214 


Hs.5753 


irtositol{rnyo)-1(or 4)-monophosphatase 2 


2.27 


Z26 


445911 


A19B5987 


H3. 145645 


ESTs, Moderately similar to ALU1_HUMAN A 


1.42 


9 7 A 

Z.74 


408349 


BE546947 


Hs.44276 


homeo box C10 


. eft 
l.OU 


9 nc 


422545 


X02761 


Hs.287820 


fibronectin 1 


1.77 


3.02 


406643 


M77976 


Hs.347939 


hemoglobin, alpha 2 


1.57 


1.35 


407228 


M25079 


Hs. 155376 


hemoglobin, beta 


1.81 


1.50 






Hs.1 48324 


ESTs 


1.90 


3.19 


402305 






C19M0735 k .gi|4508027]ref|NP^0034M i 


2.25 


1.49 


427683 


BE545490 


Hs.15053 


Homo sapiens HCMOGT-1 mRNA for sperm ant 


1.08 


Z25 


441690 


R81733 


Hs.33106 


ESTs 


1.80 


Z65 


434487 


AF143867 


Hs.337588 


ESTs, Moderately similar to S65657 alpha 


Z13 


1.63 


403352 






KM.00161 S*:Homo sapiens actin, gamma 2, 


Z33 


Z22 


445496 


AB007860 


Hs.1 2802 


development and differentiation enhancin 


1.12 


Z60 


425234 


AW152225 


Hs.165909 


ESTs, Weakly simitar to I3S022 hypothec 


1.83 


aos 


402356 


AV648601 




apefipo protein B (including Ag(x) antige 


1.32 


2.05 


427254 


AL121523 


Hs.97774 


ESTs 


Z44 


1.00 


414533 


AA149060 


Hs.296100 


ESTs 


1.70 


Z06 


430157 


BE348706 


Hs.278543 


ESTs 


Z54 


3.00 


413433 


NM_003199 


Hs.32619B 


transcription factor 4 


2.26 


1.41 


410532 


T53088 


Hs.155376 


hemoglobin, beta 


1.52 


1.56 


405779 






NM_005367:Homo sapiens melanoma antigen, 


2.42 


1.22 


450455 


AL1 17424 


Hs.25035 


chloride intracellular channel 4 


Z48 


Z13 


414081 


AW969976 


Hs.279009 


matrix Gfa protein 


1.81 


1.53 


414761 


AU077228 


Hs.77256 


enhancer of zeste (Drosophila) homotog 2 


1.65 


Z08 


415062 


H45100 


Hs.49753 


uvea) auto antigen with coiled coil domai 


1.62 


3.75 


406317 






C20O2658*:gil5625694|gb.AAF19354.1tAF1B5 


1.68 


2.11 


453259 


R93125 


Hs.124167 


ESTs 


1.08 


Z25 


445937 


AI452943 


Hs.321231 


UDP-GatbetaGlcNAc beta 1 ,4- galactosyl! 


1.76 


1.01 


434370 


AF130988 


Hs-58346 


ectodysplasln 1, anhWrotic receptor 


1.12 


Z08 


418058 


AW161552 


Hs.83381 


guanine nucleotide binding protein 11 


Z02 


Z40 
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432331 


W37862 


HS.27436B 


MSTP032 protein 


4.36 


118 


451736 


AW080356 


Hs.23889 


ESTs, Weakly similar to ALU7_HUMAN ALU S 


1.51 


3.45 


413109 


AW389B45 


Hs.1 10855 


ESTs 


4.34 


5.98 


413643 


AA! 30987 


Hs.1 88727 


ESTs 


1.30 


142 


433217 


AB040914 


Hs.276628 


K1AA1481 protein 


1.70 


149 


435232 


NMJXM262 


Hs.4854 


cyclln-dependent kinase inhibitor 2C (p1 


1.79 


1.66 


438006 


BE1 48799 


Hs.1 27951 


hypothetical protein FU 14503 


1.76 


104 


410102 


AW248508 


Hs.279727 


ESTs; homdogue of PEM-3 [Ciona savignyi 


2.32 


2.45 


433656 


AW974941 


Hs.292385 


ESTs, Weakly similar to I78885 serine/th 


1.14 


150 


415714 


NM.002290 


Hs.78672 


lamirtin, alpha 4 


2.52 


1.13 


450282 


AA007655 


Hs.93523 


ESTs 


1.40 


2.58 


442855 


AI074465 


Hs.1 33469 


ESTs 


1.54 


2.20 


432917 


NMJJ14125 


Hs.241517 


PRO0327 protein 


124 


3.03 


429041 


AJ132820 


Hs 194768 


a disintegrin and melalloproteinase doma 


1.61 


160 


442807 


AL049274 


Hs8736 


Homo sapiens mRNA; cONA OKFZp564H203 |fr 


1.73 


1.19 


427719 


Al 393122 


Hs.1 34726 


ESTs 


1.46 


100 


408778 


AI500519 


Hs 63382 


hypothetical protein PR02714 


1.46 


158 


418870 


AF 147204 


Hs 89414 


chsmokine (C-X-C motif), receptor 4 (fus 


1.40 


103 


424425 


AB 031 480 


Hs.1 46824 


SPR1 protein 


1.60 


1.11 


445391 


T92576 


Hs.1 91 168 


ESTs 


1.69 


140 


446899 


NMJHJ5397 


Hs.1 6426 


podocatyxfn-tike 


1.22 


142 


420996 


AK001927 


Hs.1 00895 


hypothetical protein FU) 10462 


168 


2.98 


424909 


S78187 


Hs.1 53752 


eel division cycle 25B 


2.18 


1.33 


413593 


AA205248 




gbzq78c11r1 Stratagene hNT neuron (937 


1.69 


100 


408734 


AW264996 


Hs.254299 


ESTs 


106 


194 


413880 


AI660842 


Hs.1 10915 


interleuktn 22 receptor 


1.24 


120 


437063 


AA351109 


Hs.5437 


Taxi (human T-cell leukemia virus type 1 


1.78 


160 


418044 


AI640532 


Hs.1 19830 


ESTs, WeaHy similar to ALUF_HUMAN !!!! 


1.54 


153 


441971 


W27060 


Hs.265855 


ESTs 


1.62 


113 


450401 


AW959281 


Hs.8184 


ESTs 


1.42 


130 


440157 


AA868350 


Hs.343636 


ESTs 


1.38 


160 


457587 


AA992841 


Hs.27263 


KIAA1 458 protein 


1.47 


142 


440707 


BE256751 


Hs.22667 


Homo sapiens cDNA: FU22073 fis, clone H 


1.18 


110 


402241 






Target Exon 


158 


152 


413428 


AA430155 


Hs.1 51343 


KIM1524 protein 


1.22 


145 


416735 


R11275 


Hs.194485 


ESTs 


1.14 


114 


421562 


AI910275 




trefoil factor 1 (breast cancer, estroge 


1.25 


1.03 


431031 


AA830335 


Hs.1 05273 


ESTs 


135 


195 


433336 


AF017986 


Hs.31386 


secreted frizzled-related protein 2 


172 


1.00 


420786 


AW296466 


Hs 43628 


deleted in lymphocytic leukemia, 2 


1.23 


160 


401335 






Target Exon 


1.18 


1.68 


417670 


R07785 




gbryfl 5c06j1 Soares fetal liver spleen 


1.56 


100 


406314 






C14O0102O:gi|1 259744 1Jgb|AAG60049.1|AF31 


1.60 


3.08 


458981 


AW968318 


Hs.285996 


hypothetical protein FU23375 


1.70 


150 


417509 


AA203414 


Hs 42009 


ESTs 


1.82 


105 


452732 


BE30007B 


Hs.60449 


Homo sapiens, clone IMAGE:3535294, mRNA, 


1.34 


1.37 


418678 


MM 001327 


Hs 87225 


cancer/teslis antio&n (NY-ESO-1) 


1.37 


1.02 


457871 


All 68278 




ESTs 


1.20 


119 


4441 R3 






nh , nr l >dn07 x1 SfiArftS nfcjfwita 8to9waeks 


1.43 


1.22 


413276 


224725 


no. < vluu 


mHnoen inducible 2 


1.78 


128 


421097 


AI280112 


Hs 125232 


Homo sapiens cDNA FU 13266 fis, ciona OV 


155 


160 


417151 


AA1 94055 


Hs 293858 


ESTs 


1.68 


1.67 


453556 


AA425414 


Hs 33287 


mrjpar factor 1VB 


106 


140 


440859 


AW070865 


Hs.346390 


ESTs 


1.12 


1.70 


420629 


AW204343 


Hs.1 56823 


ESTs, Weakly similar to T30868 RhoA-bind 


1.21 


138 


422363 


T55979 


Hs.1 15474 


replication factor C (activator 1) 3 (38 


1.58 


115 


434831 


AA24ffl)60 


Hs. 273397 


KIAA0710 gene product 


1.69 


1.78 


412055 


AAOQ9907 


Hs. 771808 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


1.36 


165 


445468 


AW450439 




ESTs 


1.52 


150 


444550 


BE250716 


Hs. 87614 


ESTs 


1.30 


128 


417259 


AW9D3838 


Hs 81800 


chortdroitin sulfate proteoglycan 2 (vers 


1.50 


3.02 




AWTfi7Q02 


Ha 096443 


Homo sapiens mRNA; cDNA DKFZp564N1063 (f 


1.24 


195 


413444 


BE141019 




gb:^HT0067-201 039-002*10 HT0067 Homo 


"1.68 


180 


433844 


AA610175 


Hs.179647 


Homo sapiens cDNA FU 12195 lis, done MA 


1.49 


101 


427055 


AI301740 


Hs.173381 


dihydropyrimldinase-iike 2 


1.11 


158 


454244 


R51604 


Hs.300842 


K1AA1608 protein 


1.00 


102 


429503 


AA394183 


Hs.26873 


ESTs 


158 


4.08 


422940 


BE077458 




gb:RC1-BT060649050(W)15-b04 BT0606 Homo 


3.48 


146 


407949 


W21874 


Hs.247057 


ESTs. Weakly similar to 2109260A B cell 


3.13 


1.48 


437312 


AA809350 


Hs.246160 


ESTs 


1.10 


105 


449426 


T92251 


Hs.198882 


ESTs 


1.22 


108 


447620 


AW290951 


Hs.224965 


ESTs 


1.80 


118 


444700 


NM.003645 


Hs.1 1729 


fetty-acjd-Coenzyme A ligase, very long- 


1.72 


181 


436258 


AW887491 


Hs.107125 


plasmalemma vesicle associated protein 


1.99 


1.80 


415712 


AW249188 


Hs.169577 


Homo sapiens cDNA FU 14743 fis, done NT 


1.66 


198 


432877 


AW974111 


Hs.292477 


ESTs 


1.63 


179 


412085 


AW891667 




gb:CM3.OT0089-110500-179-h09 NT0089 Homo 


1.40 


108 



175 
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431421 


AW969118 


Hs.108144 


ESTs, Weakly similar to unnamed protein 


1.96 


3.58 


409714 


AW367812 


Hs.199961 


ESTs, Weakly similar to ALU7_HUMAN ALU S 


1.24 


Z10 


423013 


AW875443 


Hs.22209 


secreted modular cakaum-Wnding protein 


1.49 


Z09 


422663 


AW500087 


Hs.119014 


zinc finger protein 175 


1.01 


Z78 


439737 


AI751438 


Hs.41271 


Homo sapiens mRNA fid length Insert cDN 


1.54 


Z57 


413196 


AA127386 




gb:zn90d09.r1 Stratagene lung carcinoma 


1.04 


Z18 


439349 


AJ560898 


Hs.195602 


ESTs 


203 


Z43 


443005 


AIQ271B4 


Hs.200918 


ESTs 


1.42 


2.10 


424762 


AL1 19442 


Hs.183684 


eukaryotic translation initiation factor 


2.58 


3.43 


427373 


AB007972 


Hs.1 30760 


myosin phosphatase, target subunil 2 


208 


1.70 


413916 


N49813 


Hs.75615 


apolipoprotein C-ll 


0.70 


0.42 


416332 


R34976 


Hs.78293 


ESTs 


274 


Z43 


426552 


BE297660 


Hs.170328 


moesin 


1.28 


1.52. 


456583 


AF179897 


Hs.104105 


Meis (mouse) homolog 2 


1.42 


2.02 


447214 


A1367288 


Hs.273621 


Homo sapiens cDNA: FU21 350 fis. done C 


1.14 


Z10 


449254 


W26908 


Hs.172762 


ESTs 


2.04 


Z50 


443651 


W22152 


Hs.282929 


ESTs 


2.94 


3.08 


421021 


AA808018 


Hs.109302 


ESTs 


1.41 


Z04 


419741 


NMJXJ7019 


Hs.93002 


ubiquilin carrier protein E2-C 


1.61 


1.21 


432027 


AL095678 


Hs.272353 


WAA0957 protein 


1.70 


Z78 


452668 


AA721140 




ESTs, Weakly similar to putative p150 (H 


1.80 


295 


417042 


C75563 


Hs.113029 


nbosomal protein S25 


2.22 


3.20 


443574 


U83993 


Hs.321709 


purinergic receptor P2X, ligand-gated io 


1.21 


2.51 


429372 


AA451859 


Hs.99253 


ESTs 


1.28 


Z43 


424290 


AA338396 




gb:EST43386 Fetal brain 1 Homo sapiens c 


1.82 


203 


42851B 


AW969656 




gb:EST38l733 MAGE resequences, MAGK Homo 


1.72 


Z52 


455649 


BE065051 




gb:RC1-8T0313-HO5(XW)17-c04 BT0313 Homo 


1.65 


103 


414665 


AA160873 


Hs.332053 


serum amyloid A1 


1.36 


1.08 


418298 


AA256014 


Hs.86682 


Homo sapiens cDMA: FU21578 fis, done C 


1.04 


Z03 


429655 


U48959 


Hs.211582 


myosin, light polypeptide kinase 


4.94 


4.34 


433924 


M618304 


Hs.258785 


ESTs 


1.44 


Z40 


452683 


AI089575 


Hs.9071 


progesterone membrane binding protein 


1.48 


Z48 


439437 


A1207788 


Hs.343628 


staryltransferase 4B (beta-galactostdase 


1.36 


Z33 


432314 


AA533447 


Hs.312989 


ESTs 


0.96 


Z78 


400881 






NM_025080:Homo sapiens hypothetical pro! 


1.70 


3.15 


426477 


AA379464 




gb:EST92386 Skin tumor 1 Homo sapiens cO 


201 


Z37 


454741 


BE154396 




gb:CM2-KT0342-091 299-0504)05 HT0342 Homo 


Z12 


3.44 


423977 


AA333232 




gb:EST37283 Embryo, 8 week I Homo sapien 


1.38 


Z13 


450396 


AU077002 


Hs.24950 


regulator of G-prolein signalling 5 


Z13 


3,28 


431642 


KM 005764 


Hs.271473 


epithelial protein up-regulated in card 


1.90 


Z23 


415157 


D63257 




gb:HUM514B08B Oontech human placenta po 


1.29 


Z90 


418236 


AW994005 


Hs.337534 


ESTs 


1.74 


Z37 


454390 


AB020713 


Hs.56966 


WAA0906 protein 


1.47 


1.38 


436143 


AA705245 


Hs.192189 


ESTs 


1.46 


Z45 


436251 


BE515065 


Hs.296585 


nucleolar protein (KKE/D repeat) 


1.43 


Z07 


450735 


AI732321 




SRY (sex determining region Y)-box 4 


1.36 


Z02 


420136 


AW801090 


Hs.195851 


actin, alpha 2. smooth muscle, aorta 


Z70 


1.68 


447100 


AJ361801 


Hs.167130 


hypothetical protein 


1.66 


Z03 


453577 


AL043049 




gb:DKFZp434A1523_r1 434 (synonym: htes3) 


1,41 


Z75 


408522 


AI541214 


Hs.46320 


Small protine-rich protein SPRK [human, 


1.59 


1.38 


412632 


All 20379 


Hs.74294 


aldehyde dehydrogenase 7 family, member 


1.81 


Z51 


423291 


NMJJ04129 


Hs.126590 


guanyiate cyclase 1, soluble, beta 2 


1.54 


Z83 


456172 


R99050 




gb:yq65c02j1 Soares fetal liver spleen 


1.46 


Z68 


452123 


AI267615 


Hs.38022 


ESTs 


1.24 


1.93 


433900 


AA721668 


Hs.257761 


ESTs 


1.78 


Z66 


408436 


R31954 


Hs.7885 


phosphatidylinositd binding dathrin as 


1.21 


Z35 


417123 


BE326521 


Hs.159450 


ESTs 


1.46 


Z09 


436023 


T81819 


Hs.302251 


ESTs 


Z68 


Z70 


454150 


AA131893 


Hs.154088 


hypothetical protein FU22756 


1.40 


Z50 


444094 


AJ695764 


Hs.202394 


ESTs 


1.28 


4.03 


429176 


AW975021 


Hs.193800 


ESTs 


1.08 


253 


422259 


AA307584 




gb:EST1 78498 Colon carcinoma (HCC) cell 


1.80 


103 


451164 


AA015912 


Hs.60659 


ESTs, Weakly similar to T46471 hypotheB 


1.47 


1.17 


417501 


AL041219 


Hs.82222 


sema domain, immunoglobufin domain (Ig), 
gb:QV^BT0296-14020(W)85.h01 BT0296Homo 


Z92 


Z70 


455642 


BE063965 




1.70 


270 


443387 


BE139135 


Hs^54629 


ESTs 


1.48 


232 


420995 


AA282495 


Hs.89014 


ESTs 


1.45 


1.51 


407329 


AA576061 


Hs.269834 


ESTs. Weakly similar to ALUD.HUMAN !!!! 


1.13 


238 


438797 


C16161 


Hs.283040 


hypothetical protein PR02543 


0.99 


275 


443357 


AW016773 




low molecular mass ubiqulnone-blnding pr 


1.60 


208 


412656 


AF005011 


Hs.74375 


dishevelled 1 (homologous to Drosophila 


1.32 


1.13 


427377 


AU077029 


Hs.177543 


antigen identified by monoclonal anCbod 


1.24 


0.79 


412200 


R08110 


Hs.187462 


ESTs, Weakly similar to 138022 hypotheU 


1.35 


1.54 


432586 


AA568548 




ESTs 


1.50 


225 


411590 


T96183 




gb:ye09f07.s1 Stratagene lung (937210) H 


1.22 


253 


422672 


X12784 


Hs.119129 


collagen, type IV, alpha 1 


Z27 


220 


420256 


U84722 


Hs.76206 


cadherln 5, type 2, VE-catflierfn (vascula 


1.59 


211 
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419300 


AJ469960 


Hs.170698 


ESTs 


1.30 


2.68 


410805 


AW804742 


Hs.84264 


acidic protein rich in leucines 


1.16 


2.28 


452560 


BE0770B4 


Hs. 99969 


ESTs 


1.44 


2.58 


448429 


D17408 


Hs.21223 


caJponin 1, basic, smooth muscle 


4.02 


1.00 


424436 


AW818428 


Hs.4953 


oolni autoanSaen oolain subfamily a. 3 


1.10 


2.00 


447400 


AK0flfl392 


Hs 18457 


hvooihaticai orotfiin FU20315 


1.45 


1.28 


422522 


Alft9149fl 




ESTs Hkihtv similar to S94541 1 done 4 


1.82 


2.10 


443696 


ttiVOU/444 


Hs 1 ldfi9? 


ESTs 


1.98 


2.01 


436094 


r\l / 30 f V I 


Hs 999997 


ESTs 


1*34 


2.40 


420168 


AP917WR 
ttr^l f QUO 


rra.33034 




1.58 


2.45 


■wow 


APflflARR9 
r\TUU400£ 


H<: 91Q3RR 
nb. £jyooo 


euniavin KinHInn nmttun 1 


1.34 


Z43 


439022 


AM0OO033 


no. 1 f 031/4 


ESTs 


2.76 


2.40 




AA97A397 


He 13R917 
na. i ooio/ 


pqTr Mnrlprafelv similar Jrt Y14fl HI IMAM H 
Co is, MOucJaiciy sinuuji m i i4u_nuiviMi , i n 


1.78 


2.65 




MM/ 030 DO 


ns. j touoo 




1.50 


2.40 


420689 


H7QQ7Q 


Un RRR73 

ns.oDOro 


ESTs 


1.26 


2.28 


ft O 300 


TU3/O0 


He 997fl«; 


gdiiuncKunirioouijtic *;iu ^wvonj ^ i ecejJiu 


1.72 


1.29 


439943 


AW0fl37R9 


Hs 194690 


ESTs 


145 


Z84 


4440W 


/wooyuo 


He 19Q1RR 
n5. l£3 100 


PQTe 
CO IS 


1.24 


2.35 


443014 


AtQRQAOn 

ttjyc343U 


He 1Q77fn 
n5. 13/ »IH 


CO 1 S 


1.12 


2.22 


444363 


AMi9R97 


na. i hjo ju 


ESTs 


1*32 


2.08 


424479 


APflfidTlfl 
r\T \J04Z00 


(L3. 143U30 


em/vtthalSn 
omuulliclul 


1.59 


1.10 


437321 




Hs 293098 


peje VteaWv similar to 2109280A B cell 


1.28 


2.07 


431926 


AW972724 




gb:EST384816 MAGE resequences, MAGL Homo 


1.52 


Z63 


433540 


AWW0125 

aJU iw 


Hs 240443 


Hnmn s^nlpns cDNA: FU23538 fis done L 


1.69 


1.30 


415901 


H08396 


Hs.76118 


ubiquitln carboxyMerrnina] esterase L1 


1.71 


2.19 


4371 99 


Al 110175 

fM» 1 IU 1 1 J 


Hs 306337 


Homo saotens mRWA* cDHA DKFZd564H061 6 (i 


1.48 


2.50 


457450 


AW9Q4163 


Hs.146127 


ESTs 


1.07 


2.60 


456678 


AF141305 


Hs.173736 


ancient ubiquitous protein 1 


1.44 


Z35 


451079 


rviut 1 3LKJ 


Hs 240728 


ESTs Moderately similar to PC4259 ferri 


0.95 


3.00 


405944 






Tampt Pvnn 


1.48 


2.45 


408877 


A A 470/1.17 


Hs 130315 


PSTs Waakiv similar to A47582 B-ce!l or 


1.38 


2.20 


446682 


AW205632 


Hs.211198 


ESTs 


1.38 


3.00 


431380 


AWfiirr?fi9 


Hs 9Q1A03 
na,tj iuvj 


ESTs 


1.43 


2.64 


442027 




Hs. 128395 


ESTs 


1.18 


2.43 


423578 


AW960454 


Hs.222830 


ESTs 


1.56 


2.18 


441495 


AW294603 


Hs 127039 


ESTs 


2.80 


1.73 


417900 


RP7-V11 97 


Hs 89906 


PrV N 9fl /mil rRvisinn cvda 20 S rprnvi 


1.36 


1.18 


443949 


AWft9741Q 


Rs 235070 


ESTs 


130 


2.28 


AAtiAQZ 
44U430 


AAM7919 


He 141R1 


hvrvrthptiral nrrrfpm nKP7rvd1dl1Q^n 

nypouicuCoi proieiii ui\rL^y*<yi\ 1 3ou 


1.74 


2.78 


449948 


R19156 


Hs 207QR 


ESTs 


1.12 


2.23 




W77Q11 
VY( f 31 1 


He 1in00R 


COTr 
CO 15 


1.34 


2.85 


4zozzo 


HAO0ZDU4 


He 19R3RQ 


Thu 1 roll eitrfarA anflnpfl 
1 fly | Cell auiidbet dllUyty I 


1.24 


1.09 


4OO103 


AA7RCT7RR 
MA/ 00/00 


He 19fWTK 
ns. i c U30O 


CO IS 


1.30 


2.10 


qooyoo 


MJ 1 / 400 1 


He lOnRT^ 
n5. i ouoij 


ESTs 


1.14 


2.15 


■Uf 131 


NIYMJUOD4D 


He 'MIW 


carina nmloiKO inhihitnr KqtsI h/np R 
ocnilo piUlCKuc UilHUilUI, issxtxM lypc, o 


1.21 


2.18 


41 lDOZ 


mWoDG030 




nh-r*/L1.rTn97R_1Q1 nQ0_A9AJin PTn97R Hnrm 


1 flR 
1.00 


1.94 


HIV 1 31 


AA/oSlOJ 


nS.ooooU 


CO IS 


1 RR 
l.OD 


2.71 


wy^y i 


A\ATT7*W79 
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457096 


A1809202 


Hs.208343 


426123 


AA370352 




449445 


AW197349 


Hs.232197 


430683 


AC004862 


Hs.247768 


440642 


AI744995 




455236 


AW875972 




449622 


AW013915 


Hs.196578 


415116 


AA 160363 


Hs^69956 


457269 


A1338993 


Hs.134535 


427877 


AW138725 


Hs.178057 


454631 


AW811324 




458390 


AI792585 


Hs.1 33272 


435844 


M700856 


Hs.59651 


427237 


AA399964 


Hs.97763 


408855 


T83061 


Hs.319946 


442151 


AI733404 


Hs.1 28865 


412708 


R26830 


Hs.1 06137 


417262 


AA195276 


Hs.263858 


419362 


N64116 


Hs.24624 



phospholysine phosphohisfidlne inorganic 
ESTs 

gb:PM1-DT0054-231299-002-cO2 DT0054 Homo 
ESTs 

ESTs, Weakly similar to 138022 hypolheti 
gb:hm55g10.x1 NC1_CGAP_RDF1 Homo sapiens 
ESTs 

nudaar receptor suMamSy 1, group I, m 
gb:EST378554 MAGE resequences, MAGI Homo 
ESTs 

spficing factor 3b, subunit 2, 145kD 
NM_Q22775:Homo sapiens hypothetical prot 
ESTs 

regulator of differentiation (In S. pomb 
ESTs 

KIAA0870 protein 

gb:EST391359 MAGE resequences, MAGP Homo 
DNA2 {DMA replication helfcase, yeast h 
NM_016232*:Homo sapiens interieukin 1 re 
C800144r:gi|8923061|ref|NP^060114.1| hy 
Homo sapiens done FLB3442 PRO0872 mRNA, 
ESTs 

Human DNA sequence from done RP3-457L1 
gb:MR4-ST0062-24030W)03-g05 ST0062 Homo 
ESTs 

gbiy77d05j(1 NCJ_CGAP_Kid1 1 Homo sapien 
ESTs, Weakly similar to 138022 hypolheti 
ESTs, Weakly sircar to S65657 a!pha-1C- 
ESTs 
ESTs 

ESTs, Weakly similar to 210926QA B cell 
heat shock 90kO protein 1, alpha 
myosin, heavy polypeptide 11, smooth mus 
adaptor-related protein complex 3, mu 2 
Target Exon 

ESTs. Weakly similar to S22765 heterogen 
C200295Z-gl|1345964|spjP10O79|FBP1_STRPU 
ankylosis, progressive (mouse) homolog 
ESTs, Weakly similar to KBF3.HUMAN NUCLE 
ESTs 

gb:Human pTR2 mRNA for repetitive sequen 

fidgetin 

ESTs 

hypothetical protein FU 10477 

Target Exon 

ESTs 

gb:tr34b07j(1 Na_CGAP_Ov23 Homo sapiens 
NM w 002240*:Homo sapiens potassium inward 
ESTs, Weakly skvSar to 138022 hypolheti 
ESTs 

C3001164*:gi|1730196|sp|P50573|GAR3_RAT 
ESTs 

WAA0143 protein 

Homo sapiens cDNA FU11812 fis, clone HE 

hypothetical protein RJ 10948 

ESTs 

Target Exon 

gb:CM4-OI0045-010200-514-frj3 CN0045 Homo 
ESTs, Weakly similar to cereb reside sulf 
gb:EST82246 Prostate gland I Homo sapien 
ESTs 

Homo sapiens PAC done RP4-697H1 7 from 7 

ESTs, Moderately similar to ALU4JWMAN A 

gb:CM3-PT0014471299451-W5 PT0014Homo 

ESTs 

ESTs 

ESTs 

ESTs 

gML33T0141-131099-017-A02ST0141 Homo 
ESTs, Weakty similar to AlUC.HUMAN HI! 
ESTs, Weakty simitar to 178885 serine/th 
ESTs 

Homo sapiens mRNA for KIAA1727 protein, 
ESTs 

ESTs, Weakty similar to CGHU7L collagen 
ESTs, Moderately similar to B34087 hypot 
hypothefcaJ protein FU21 945 
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447248 AW295831 Hs.6496 ESTs 

415622 F13010 Hs.12400 ESTs 

414065 AW515373 Hs.271249 Homo sapiens cONA FU 13580 fis, done PL 

414585 W46954 Hs.334716 hypothetical protein MGC16291 

5 443197 Z43613 gb:HSC1GD091 normalized infant brain cDN 

428266 AI382001 Hs.43590 ESTs 

447083 A1472124 Hs.157757 ESTs 

412302 AW936334 gb:GV4-DT002 1-281 299^)70^05 DT0021 Homo 

445555 AW974013 ESTs 

10 453117 AW162044 Hs.104203 hypothetical protein MGC12981 

436757 AW975663 Hs.293404 ESTs, Weakly similar to ALU LHUMAN ALUS 

431976 AA719001 Hs.291065 ESTs 

430657 AA482910 Hs.279664 ESTs 

438744 BE314727 Hs.75721 profiltnl 

15 439325 AF086139 Hs.150423 cycfoHJependent kinase 9 (CDC2-related 

438117 AA328041 Hs.194329 hypothetical protein FU21 174 

401686 NM_014587*:Homo sapiens SRY (sex determi 

420269 U72937 Hs.96264 alpha thalassemta/mental retardation syn 

434288 AW1B9075 Hs.116265 fibrM 

20 433215 AB040912 Hs.191098 hypothetical protein FL) 1 1 598 

413429 BE139117 Hs.278881 ESTs 

426417 AA377908 Hs. 13254 ESTs 

413882 AA132973 Hs.184492 ESTs 

413346 AA128586 gb;zl24h06.r1 Soara_pregnanOrterus_NbH 

25 445020 AI205655 Hs.147221 ESTs 

418175 AW967054 Hs.206312 ESTs, Weakly similar to 138022 hypothefi 

429582 A1569068 Hs^2247 ESTs 

409134 AW340389 Hs.250585 ESTs 

415642 U19878 Hs.336224 transmembrane protein with EGF-Eke and 

30 435667 F13625 Hs.124183 ESTs 

440513 BE407106 Hs.65907 Homo sapiens, clone !MAGE:3959816, mRNA, 

419711 C02621 Hs.159282 ESTs 

434249 AA987537 Hs.129875 ESTs 

437355 AL359557 Hs.306508 Homo sapiens mRNA; cDMA DKFZp7620l415 (f 

35 428360 H10291 HsJ0974 ESTs 

435339 A1358300 Hs.129827 ESTs 

435345 AW360966 Hs.6653 ESTs 

435105 A1878982 Hs.131859 Homo sapiens F-box protein FBX11 mRNA, p 

459645 AA074346 Hs.250715 ESTs 

40 449691 M002143 Hs.21413 solute carrier family 12. (potasshimchl 

425955 T96509 Hs.248549 ESTs, Moderately similar to S65657 alpha 

437272 AW975957 gb:EST388066 MAGE resequences, MAGN Homo 

456955 NM-006925 Hs.1 66975 spTicing factor, arginlne/serfne-rlch 5 

421362 AK000050 Hs.103853 hypothetical protein FU20043 

45 457926 AA452378 Hs.11637 Homo sapiens mRNA; cONA DKFZp547Jl25 (fr 

444557 AI167637 Hs.146924 ESTs 

434476 AW858520 Hs.84264 acidic protein rich in leucines 

458059 AW015588 Hs.137232 ESTs, Weakly similar to S65657 alpha-1C- 

413595 AW235215 Hs.16145 ESTs 

50 417281 R98773 rte^68883 ESTs 

445689 BE158869 gb:<WHT0398-21010(M>96-fl>8 HT0398 Homo 

423249 AA3236B2 Hs.125374 ESTs, Weaxty similar to S26689 hypotheti 

408366 AW511255 Hs.346442 ESTs 

441359 AI435179 Hs.126B20 ESTs 

55 413068 BE063792 gb:GV3^T0295-2OT10XMI6S<J06BT0295Komo 

441322 AW071851 Hs.130628 ESTs 

409124 AW292809 Hs.50727 N-acetylglucosaminidase, alpha- (Sanfili 

432413 AK000257 Hs.274505 Homo sapiens mRNA; cDNA DKFZp564A21 6 (fr 

425391 AI248252 Hs.160672 ESTs 

60 443861 AW449462 Hs.134743 ESTs 

454609 AW810204 gb:MR4-$T012M21199417408ST0125Homo 

425893 AA629695 gb:ad43b07.s1 Stratagene lung carcinoma 

443611 NM.014397 Hs.9625 N1MA (never In mitosis gene a}-related k 

410359 R38624 Hs.106313 ESTs 

65 406308 NM_025l9£Homo sapiens hypothefica! prot 

432476 T94344 Hs.326263 ESTs 

435073 AA664078 gb:ac04a05.s1 Stratagene lung (937210) H 

420581 AA278459 Hs.151940 ESTs 

435579 AJ332373 Hs.156924 ESTs 

70 439633 AF086464 Hs.86248 ESTs 

430551 AA481150 Hs.136343 ESTs 

450855 T979B8 Hs.295605 mannosidase, alpha, class 2A, member 2 

444326 A1939357 Hs.270710 ESTs 

412149 R49355 Hs.273824 ESTs 

75 455116 AW857271 gb:CM0-CTO307-210l00-15B^09 CT0307 Homo 

449626 AA774247 Hs.301637 zinc finger protein 258 

410047 AI167810 Hs.132390 zinc finger protein 36 (KOX 18) 
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418865 
402762 
436449 
403488 
431235 
448576 
408100 
433436 
422337 
426160 
447008 
420141 
423840 
447793 
407328 
432451 
421311 
444649 
448688 
428847 
413750 
429355 
427798 
431179 
451719 
438094 
418504 
407414 
416410 
439141 
441181 
434482 
455757 
425787 
405727 
441846 
451945 
438432 
451140 
407341 
453041 
437613 
451507 
430259 
453669 
455065 
442220 
437936 
442556 
405223 
437225 
421101 
436200 
402025 
407019 
451305 
423450 
423139 
451763 
458915 
452829 
446383 
432576 
433820 
419719 
415868 
420738 
446614 
404167 
417074 
401215 
421600 
426248 
454523 
420656 
402833 
438910 



AW117500 

AI418027 

AA318271 

AB026730 

AW205382 

AW162474 

R38572 

AA206020 

BE010189 

AA702961 

AA332434 

A1424924 

AA508857 

AW972771 

N71848 

AW207523 

R94570 

A1954833 

BE161453 

AW973253 

AA412499 

AI338644 

A1373532 

A1821755 

BE159718 

AF072164 

H53777 

A1241470 

AA416925 

AF143331 

BE079531 

AA363867 

AW850980 

BE504055 

AW444990 

AW411354 

AA918886 

AI680737 

R19892 

AW291109 

BE550182 

AL049029 

AW854352 

AL037800 

AW798475 

AL137751 

AW975982 
AFO 10446 
R51385 

U49973 

AW003571 

AJ290445 

AW402725 

AW294647 

AI915689 

AI955579 

T05816 

AW157424 

A1401627 

AA844700 

H06728 

NM.0041B5 

AK001733 

Z49B78 



Hs.104241 ESTs 



T18988 

AW803980 

AA279098 

AA827921 



Hs.120361 

Hs.250905 
Hs.21495 
Hs.42676 



Hs.167460 

Hs.124103 
Hs.72465 
Hs.211203 
Hs.187748 
Hs.292471 
Hs.283609 
Hs.197628 
Hs.266869 
Hs.98881 

Hs.292689 
Hs.104779 
Hs.1 95432 
Hs.157910 
Hs.131805 
Hs.85335 

Hs.36822 
Hs.268982 
Hs.121076 
Hs.16073 

Hs.1 55029 



Hs.211420 

Hs.258800 

Hs.26002 

Hs^04918 

Hs.289068 

Hs.10267 

Hs.208767 

Hs.1 27826 

Hs.7258 

HsJ148 

Hs.288549 

Hs.8379 

Hs.292935 
Hs.101840 
Hs.124881 



Hs.211191 

Hs.128759 

Hs.288560 

Hs.233634 

Hs.212781 

Hs.63368 

Hs.92511 

Hs.165954 

Hs.174067 

Hs.39297 

Hs.21017 

Hs.258575 

Hs.15562 

Hs.81131 

Hs.323231 
Hs.293668 

Hs.1 87636 

Hs.291858 



ENSP00000235171*:6AP junction beta-4 pro 
ESTs 

ENSP00000201948:KARYOPHER!N BETA2B HOMOL 
hypothetical protein 

UDP-Gal:belaGlcNAc beta 1,3-ga!actosyltr 
KIAA0781 protein 

Bruno (Orosophita} -like 6, RNA binding 
gbyc87c11.s1 Soares infant brain ir>f£B H 
epfictng factor, arginine/serine-rich 3 
nuclear receptor subfamily 1, group i, m 
ESTs, Weakly similar to 138344 titin, ca 
ESTs, Weakly similar to non-lens beta ga 
ESTs 

ESTs. Weakly similar to ALU1_HUMAN ALU S 
ESTs, Weakly similar to ALU1.HUMAN ALU S 
hypothefical protein PRO2032 
ESTs 

ESTs, Weakly similar to ALU 1.HUMAN ALU S 
ESTs 

gb:IL2-HT0437-290200-045-A06 HT0437 Homo 

ESTs 

ESTs 

aldehyde dehydrogenase 2 family (mitocho 
ESTs 

ESTs, WeaMy similar to A56194 Ihrombaxa 
Homo sapiens mRNA; cDNA DKF2p564D1462 (f 
gb:Homo sapiens HSFE-1 mRNA, parfiat cds 
ESTs 
ESTs 

peptidylprotyl isomerase {cydophilin}-! 
ESTs 

gb:RC5-BT0624-240300mD08 BT0624 Homo 
ESTs 

CX001244:gi|11420428|ref|XP_004814.1| be 
gML3-CT0220-15020W)66-B03 CT0220 Homo 
ESTs 

ESTs, WeaWy similar to 138022 hypolhefi 
UM domain binding 1 

ESTs, Weakly similar to ALU8_HUMAN ALU S 
Homo sapiens cDNA FU 1 191 8 fis, done HE 
MIL1 protein 

ESTs, Weakly similar to T3161 1 hypotheti 

RalGEF-fike protein 3, mouse homolog 

hypothefical protein FU 22021 

gb:RC3-CT0255-2Q010(H)24-g10 CT0255 Homo 

selenoprotein T 

hypothetfcal protein FU14710 

Homo sapiens mRNA; cDNA DKFZp586L2424 (f 

Target Exon 

ESTs 

major histocompatibility complex, class 
ESTs 

NM_021624:Homo sapiens hislamine H4 rece 
gb:Human Tiggerl transposable element, c 
ESTs, WeaWy similar to A46010 X-linked 
KIAA0524 protein 
hypothefical protein FU21106 
hypothetical protein FU 14220 
EST 

ESTs, WeaWy similar to TRHY_HUMAN TRICH 
ESTs 

ESTs. WeaWy similar to 138022 hypotheti 
ESTs 

ESTs, Moderately similar to ALU1.HUMAN A 
ESTs 

wingless4ype MMTV integration site fami 
hypothefical protein FU10871 
NM_021956*:Homo sapiens giutamate recept 



C12000457*:gi|7512178|pirlIT30337 polypr 
Homo sapiens cDNA FU11946 fis, clone HE 
ESTs 

gb:P^M0O84-2403QO-O0l-G11 UM0084 Homo 
ESTs 

C1002508:gi|6691937|emb|CAB65797.1| (AL0 
ESTs, WeaWy similar to ALUC.HUMAN ll!l 
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416170 


H42454 


Hs.220645 


ESTs 


433596 


AI762836 


Hs.271433 


ESTs, Moderately similar ta ALU2_HUMAN A 


417699 


T91491 


Hs.1 19670 


ESTs 


459805 


AL045773 




gb:DKFZp434F246 r1 434 (synonym: htes3) 


453204 


R10799 


Hs.1 91990 


ESTs 


458971 


AL119206 


Hs.1 26257 


ESTs, Weakly similar to ALU1_HUMAN ALU S 


457040 


N77624 


Hs 173717 


phosphatidic acid phosphatase type 2B 


400414 


AFnR.11 18 

nlwJ 1 IO 


Hs 283968 


Homo sapiens CATX-2 mRNA, complete cds 


426263 


AI908774 


Hs 259785 


carnitine petmiioyttransferase i r Bver 


439334 


AI14AQ7R 

Ml 1*409/ O 


He 112062 


ESTs 


455527 


AW984479 




gb:PM1-HN0012-2203004K)1-b12 HN0012 Homo 


408084 


Al 040832 


Hs 160422 


Homo sapiens done PP902 unknown mRNA 


432059 


AF227131 


Hs.272387 


taste recsrrtor hme 2 member 4 


429791 




Hs 119427 


ESTs 


438695 


AI885190 

niOw low 


Hs.1 56069 


ESTs, Weakly similar to repressor protei 


458139 




Hk 251147 


ESTs' 


413035 


DC IJJJUJ 




gb:PM4-HT0352-1 71 199-001-005 HT0352 Homo 


422444 


AA^mfififl 




gb:EST181501 JurkatT-cellsVHomosapie 




AW4iniQn 

txliH tv I9v 


u» oenfi2d 

n5i£0UO£H 


hvoolhetica! oratdn MGC4473 


411432 


AW846272 




gb:QVO-CT0179-300999-024-d12 CT0179 Homo 


445327 


Al 220082 


Hs. 147722 


ESTs 


424628 


AB011136 


Hs.151385 


KIAA0564 protein 


440197 


AW340708 


Hs.317714 


pallid (mouse) homolog, paflkfin 


409894 


RF08.1711 




gb:GV2-BTO635-22040O-158-e04 BT0635Homo 


422776 


AA316987 


Hs.1 29846 


ESTs 


428255 


AJR27478 


Hs 187670 


ESTs 


412484 


AA1 12090 


Hs.269961 


ESTs 


432789 


D25361 


Hs.3104 


K1AA0042 gene product 


430100 


AA766178 


Hs.291601 


ESTs, Highly similar to T00350 hypotheti 


419528 


AA244000 


Hs.222365 


ESTs 


441793 


AA968459 


Hs.1 58785 


ESTs 


429468 


AF033579 




T-box10 


410248 


AA1 66653 


Hs.268171 


ESTs 


401818 






NM 000664* :Homo sapiens acetyl-Coenzyme 


451724 


A1903765 




gb:UI-BT037-301 298-1 02 BT037 Homo sapien 


431866 


NMJ)12098 


Hs.8025 


angiopoiefin-like 2 


432719 


AW91«yJ11 


Hs 314460 


ESTs 


418977 


AA211flQd 

nr%£0 0U9*r 


H* 1Q1517 
ns. 1 9 1 0 1 1 


ESTs 








C6000989*:gi|7573285|emb|CAB87644.1| (AL 




□cc^qqnc 

□£049900 


Lie 211 75A 
r».£0 1 1 u*t 


ESTs 


453823 


AL1 37967 




gb:DKFZp761D2315_r1 761 (synonym: hamy2) 




AA302741 


no . Cs) 1 00 


ESTs, Moderately similar to JC5238 galac 


ANflfiA 
4UU7U4 






Tarns! F*on 


4U01U4 








miuuo 


AWR1 171ft 




obMR3-ST0191-O20200-207-d04 ST0191 Homo 


4100OI 


AA1ft17Q7 




CCTq 
CO 19 


40UOQO 


AI734179 


Hs 105676 
no. iwjuf u 


ESTs 


432420 


Al (M46SQ 


Hs.43791 


ESTs 


4M1Q7 
4UO I0» 






C2002793**aiH353148lsDlQ095o3IYR86 CAEE 




t\ruju IUJO 




nh'7rfl1f 1 7 rl Sfratanpna MT2 npi imnrf nr 


414990 


AW/4f OUU 


no. 03V 1 0 


hvnnlhpfrrsl nrfrfeln FU21935 


4U1U10 






FW9POnnnn99712fiMAAI AHASF II PROTEIN 


40*W0 


W\004 10** 


He 25Q4IM 


ESTs 


409O00 


BE244127 




ab-TCBAP1 F0661 Pediatric ore-B cell acut 


HOI 1 t-l 


AW2Q9Q47 


Hs.1 22872 


ESTs, Weakly similar to JU0033 hypoiheti 
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4Z0f 1* 


A A41 ?^4fl 


Hs 21421 


ESTs Moderat^vsim3arb>ALU1 HUMANA 


mi 090 


AA4nfi057 


Hs.97998 


ESTs 


41 £300 




Hs-344069 


gb:EST02500 Fetal brain, Stratagene (cat 


A22/U1 

HlLUHO 


All 33649 


Hs 110953 


rptinotc stsd induced 1 


A21R14 

4tl0i*» 


L12350 


Hs.108623 
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AA1 30992 




gb:zo15e0Zs1 Stratagene colon (937204) 


435563 


AF210317 


Hs.95497 


solute carrier family 2 (facilitated g!u 


452396 


H10302 


Hs.112577 


ESTs 


440612 


BE561384 




gb:601344969F1 NIH.MGCJ Homo sapiens cD 


454721 


AW815588 




gb:QVT>ST0216M)6129^066-a09 ST0216 Homo 


417796 


AA206141 


Hs.6786 


ESTs 


432864 


D16217 


Hs.279607 


calpastatin 


454480 


AA088375 


Hs.22612 


hypothetical protein DKFZp566D1346 
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AF143870 


Hs.15246 


ESTs 


418797 


AA515814 




gb:ng64b03.s1 NCl_CGAP_lip2 Homo sapiens 
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441283 


AA927670 


Hs.131704 


ESTs 
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AW290871 


Hs.129121 


ESTs 
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AL037357 
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tropomodutin 2 (neuronal) 
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AA363171 




gb:EST72986 Ovary 11 Homo sapiens cDNA 5 
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432250 


AA452088 


Hs.274170 


Opa-interacllng protein 2 


1.26 


2.71 


431911 


AK000156 


Hs.272193 


Homo sapiens cDNA FU20149 lis, clone CO 


1.46 


2.60 


413923 


AI733852 


Hs.199957 


ESTs 


1.62 


2.10 


449590 


AA694070 


Hs.268835 


ESTs 


1.20 


2.53 


438467 


AA808027 


Hs.123277 


ESTs 


1.48 


2.10 


432121 


A1824879 


Hs.211286 


ESTs, Weakly similar to 1207289A reverse 


1.27 


3.13 


412298 


AW9363D0 




gb:QV4-DT0021-281 299-070^04 OT0021 Homo 


1.42 


160 


408519 


AA679082 


Hs.43481 


hypothetical protein OKFZp564K192 


1.84 


3.70 


416067 


T79732 


Hs.1 4633 


ESTs 


1.11 


3.08 


420497 


AW2062B5 


Hs.25354B 


ESTs 


1.90 


2.48 


405704 






NM_001844*:Homo sapiens collagen, type I 


1.42 


2.90 


423443 


AI432601 


Hs.168812 


Homo sapiens cDNA FU14132 fis, clone MA 


1.42 


2.03 


415904 


Z44679 


Hs.336391 


ESTs 


1.62 


194 


413786 


AW613780 


Hs.13500 


ESTs 


0.33 


0.17 


404031 






C5001700*:gij9256616|retJNP_061761.1| pr 


1.94 
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457412 


N40711 


Hs 333300 


hypothetical protein FU 14026 


1.92 


3.20 


439719 


AF086554 


Hs.326048 


Homo sapiens mRNA; cDNA OKFZp434M0420 (f 


1.62 


130 


418161 


AI950754 


Hs.81716 


ESTs 


1.81 


142 


425894 


AW954011 


Hs.1 807 11 


ESTs 


0.92 


2.20 


41S988 


W39388 


Hs.55336 


Homo sapiens, done MGC: 17421, mRNA, com 


1.34 


157 


439668 


AI091277 


Hs.302634 


frizzled (DrosopriBa) homotog 8 


1.67 


166 


450177 


AI698091 


Hs.107845 


ESTs 


1.50 


125 


459704 


AA719572 


Hs.274441 


Homo sapiens mRNA; cDNA DKFZp434N01 1 (fr 


1.27 


3,35 


410357 


AW663614 




gb:h]22e04jc1 NCt_CGAP_U8 Homo sapiens 


0.69 


0.59 


459234 


AI940425 




gb:CM0-CT0052-150799-024-c04 CT0052 Homo 


1.67 


108 


421313 


NM_014923 


Hs.1 03329 


KIAA0970 protein 


0.57 


0.26 


431322 


AW970522 




gb:EST382704 MAGE resequences, MAGK Homo 


1.80 


173 


423086 


AB028984 


Hs.1 23420 


KIAA1061 protein 


0.40 


0.56 


425980 


AA366951 




gb:EST77963 Pancreas tumor III Homo sapt 


1.33 
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423185 


BE299590 


Hs.1 25078 


ornithine decarboxylase antizyme 1 


0.60 


0.56 


410840 


AW806924 




gb:QV4-ST0023-t 60400-1 72-h1 0 ST0023 Homo 


1.50 
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403917 






Target Exon 


1.82 
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AJ674710 


Hs.174397 


ESTs 
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AW439340 


Hs.1 89720 


ESTs 
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443318 
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Hs.133141 


ESTs 


1.45 
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441093 


AI698138 


Hs.1 26918 


ESTs 


1.40 


135 


439432 


A1984203 


Hs.57874 


ESTs 


0.88 
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454629 


AW811114 




gb:MR2-STO13M11199-016-a04 ST0131 Homo 


1.96 


131 


406207 






Target Exon 
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444872 


A1936264 




p30 DBC protein 
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1.15 


128 


404730 






Target Exon 


1.84 


178 
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AI732230 


Hs.1 91737 


ESTs 


1.49 


155 


448471 


AA158517 


Hs.21276 


collagen, type IV, alpha 3 (Goodpasture 


0.37 


0.36 


438978 


A! 095207 


Hs 307972 


ESTs 


1.57 


139 


418786 


Af?9S317 


Hs.203594 


Homo sapiens uncharactenzed gastric pro 


2.86 


134 


400416 


AF083130 




Homo sapiens GATX-1 4 mRNA, partial cds 


103 


1.55 


450446 


AI696334 


Hs.1 4450 


ESTs 


1.32 


138 


419791 




Hs 105104 


ESTs 


0.41 


0.27 


449436 


AA860329 


Hs 279307 


hypothetical protein DKFZp434!21 17 


2.01 


1.50 


430808 


L08603 


Hs.247980 


melanocorBn 4 receptor 


1.09 


118 


443116 


Al 033397 


Hs 137975 


ESTs 


1.30 


125 


437923 


BE088433 


Hs.334696 


hypothetical protein K1AA1335 


1.40 


2.50 


403294 






Target Exon 


0.98 


118 


436007 


AI247716 


Hs 232168 


ESTs 


1.38 


1.00 


430649 


AB040941 


Hs.247713 


KIAA1508 protein 


1.52 


185 


437271 


AL1 37445 


Hs 28846 


Homo sapiens mRNA; cDNA DKFZp5660134 (fr 


1.29 


159 


444500 


AV551273 


Hs.282966 


ESTs, Moderately similar to 2109260A B c 


1.22 


105 


447434 


R16890 


Hs 137135 


ESTs 


1.72 


185 


400830 






NMJ)25006:Homo sapiens hypothetical prot 


2.04 


168 


4281 14 


A1821548 

raw i *j*T\j 


Hs.98363 


ESTs, Weakly stm8ar to I38022 hypotheti 


1.09 


174 


409688 


AI1504S5 




gb:qf35a10jc1 Soares_testis_NHT Homo sap 


1.67 


1.38 


440781 


BE561823 


Hs.281434 


Homo sapiens cDNA RJ 14028 6s, done HE 


1.28 


150 


442662 


U78168 


Hs.8578 


Rap1 guanfne-nudeotide-exchange factor 


1.92 


128 


443078 


M78728 


Hs.132694 


Homo sapiens cDNA FU23149 hs, done I 


1.42 


103 


440179 


AI990151 


Hs.125904 


ESTs 


1.49 


163 


446780 


R31107 




gb:yh61g01-s1 Soares placenta Nb2HP Homo 


1.96 


178 


444173 


A1126432 


Hs.149493 


ESTs 


1.50 


110 


417939 


R53863 


Hs.337512 


ESTs, WeaWy similar to ALUB_HUMAN 111! 


1.60 


103 


428490 


BE301738 


Hs.49806 


ESTs, Weakly similar to A4601 0 X-Jinked 


0.47 


0.44 


443869 


AI141520 


Hs.151464 


ESTs, Weakly similar to ALUCJWMAN 111! 


1.25 


168 


426322 


J05068 


Hs.2012 


transcobalamin 1 (vitamin B12 binding pr 


2.12 


1.15 


411630 


042349 


Hs.71119 


Putative prostate cancer tumor suppresso 


0.64 


0.48 


454701 


AW854930 




#:PMO-CT0263-201099-003-I06 CT0263 Homo 


1.30 


133 


439795 


N77294 


Hs.194294 


ESTs 


1.17 


133 


425546 


BE409762 


Hs.26118 


hypothetical protein MGC13033 


1.17 


185 


411245 


AW833441 
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1.90 


3.98 
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434957 AF283775 Hs.35380 x 001 protein 

425724 AA362525 gb:EST72223 Namalwa B ceils i Homo sapie 

446847 T51454 Hs.82845 Homo sapiens cDNA: FU21930 fis, clone H 

453216 AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp5B6G0321 (f 

421718 AL1 17574 Homo sapiens mRNA; cDNA DKFZp434L2221 (f 

415924 H18047 Hs.335821 ESTs 

450850 AA648886 Hs.151999 ESTs 

443153 AI371823 Hs.34079 ESTs 

434420 AA688278 Hs.194864 hypothetical protein FU22578 

426126 AL118747 Hs.26691 ESTs 

421926 AA300591 gb:EST13437 Testis tumor Homo sapiens cD 

459563 AI590487 Hs.49760 gb:tt77d04jc1 NCLCGAPJ-1SC3 Homo sapiens 

453006 AI362575 Hs.303171 ESTs 

437223 C15105 Hs.330716 Homo sapiens cDNA FU 14368 fis, clone HE 

417016 AA837098 Hs.269933 ESTs 

420223 N27807 Hs.286 ribosomal protein L4 

425303 AA354785 gb:EST63098 Jurkat T-ceOs V Homo sapien 

400375 NM.014115 NM_014115*:Homo sapiens PRO0113 protein 

456169 Y07909 Hs.79368 epithefia! membrane protein 1 

409707 AA861773 Hs.313501 ESTs 

422241 Y00062 Hs.170121 protein tyrosine phosphatase, receptor t 

443152 AI803470 Hs.204529 KIAA1 806 protein 

452714 AW770994 Hs.30340 hypothetical protein WAA1 165 

415110 H04043 gb:yj45c03.rl Scares placenta Nb2HP Homo 

443251 BE185436 Hs.278839 ESTs 

433441 R37094 Hs.13742 ESTs 

434612 R76513 Hs.301183 molecule possessing ankyrin repeats Indu 

417807 R17806 Hs.269452 gb:ygQ9b06.r1 Soares infant brain 1NIB H 

426902 AI125334 Hs.97408 ESTs 

436028 AA731124 Hs.120931 ESTs 

428878 AA436884 Hs.48926 ESTs 

439749 AU89942 Hs.1 57752 Homo sapiens mRNA fuQ length insert cDN 

442435 AI986208 Hs.244760 ESTs, Highly similar to B34087 hypotheti 

416527 T62507 Hs,11038 ESTs 

441808 AW118601 Hs.127887 ESTs, Moderately similar to 1 609 195B bio 

417054 AF017060 Hs.174151 aldehyde oxidase 1 

446636 AC002563 Hs.1 5767 citron (rbo-interacting, serbe/threonin 

418442 AI873471 Hs.186898 ESTs 

416640 BE262478 Hs.79404 neuron-specific protein 

403146 Target Exon 

457397 AW969025 Hs.109154 ESTs 

439189 AI951185 Hs.144630 nuclear receptor subfamily 2, group F,m 

423969 AI830571 Hs.34969 hypotheUcat protein DKFZp566N034 

459683 AI674906 Hs.199460 gb:wc73f02.x1 NCLCGAP_Pan1 Homo sapiens 

426826 AK001890 Hs.1 72554 guanine nucleotide binding protein beta 

414462 BE622743 Hs.301064 arfapBn 1 

438027 N93047 Hs.19131 transcription factor Dp-2 (E2F dimerizat 

408623 AW811978 Hs.254037 ESTs 

433765 AA909619 Hs.1 12668 ESTs 

417132 N56605 Hs.269053 ESTs 

416815 U41514 Hs.80120 UDP4^tyWj)ha.f>gaiactosamtne:poiyp 

435186 AL119470 Hs.145631 ESTs 

411107 AW958042 Hs.95870 PTD01 5 protein 

406930 U04691 gb:Human otfactory receptor (OR1 7-21 9) g 

411026 AW8137B6 gb:RC3^T0197-12020(W)15.b05 ST0197Homo 

415766 H01613 Hs.50628 adaptor-related protein complex 4, sigma 

446018 AW531111 Hs.249727 gb:hh92e12.y1 NQ_CGAP_GU1 Homo sapiens 

440125 AW238410 Hs.253888 ESTs 

449832 AA694264 Hs.60049 ESTs 

431899 AA521381 Hs.187726 ESTs 

431531 BE142052 Hs.62654 kringle-contalning transmembrane protein 

441077 AI241273 Hs.15312 ESTs 

426799 H14843 Hs.303154 popeye protein 3 

419480 BE536584 Hs.1 22546 hypothetical protein FU2301 7 

455908 BE156306 gb:QV0-HT0^-150200-114^04HT0367rtemo 

403332 Target Exon 

455753 BE075124 gb:PM1-BT0585.1102Q0403-h02 BT0585 Homo 

404429 Target Exon 

438941 AF075047 Hs.31864 ESTs 

428745 AA433896 Hs.201534 ESTs 

411567 AW851630 gb:MW^0222.2110994)02.h06CT0222Homo 

458714 R20916 Hs.344777 ESTs 

426839 M74782 Hs.172689 interteukin 3 receptor, alpha (low affin 

444539 AI955765 Hs.146907 ESTs, Weakly similar to 2004399A chromos 

407322 AA171B92 Hs.324570 ESTs, Weakly similar to ALU3_HUMAN ALU S 

453826 AL138129 gb:DKFZp547F1 52j\ 547 (synonym: hfbrl) 

435695 AA694324 Hs.257675 ESTs 
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Target Exon 


1.80 


3.08 


417759 


R13567 


Hs.1 2548 


ESTs 


1.63 


Z58 


417527 


AA203524 




gb:zx55e10.r1 Soares_fetaiJrver_5pteen_ 


1.52 


2.02 


427526 


AA405062 


Hs.345830 


gb:zu12e04.r1 Soares JestisJJHT Homo sap 
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1.90 


4553(H) 


AW891707 




gb:CM3-NT0090^40500-17l"-e02 MT0090 Homo 


1.12 


Z20 


448121 


AL045714 


Hs.1 28553 


hypothetical protein DKFZp564F013 


0.93 
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gb:wp07e04.x1 NCLCGAPJOd 12 Homo sapfen 


1.43 


Z08 
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gb:yv76f05.s1 Soares fetal Over spleen 


1.61 


Z80 


441056 


H37860 


Hs.1 25720 


ESTs 


1.11 


Z10 


400311 


AF072164 


Hs 137570 


Homo sapiens HSFE-1 mRNA, partial cds 


Z04 


2.95 


451478 


mm 012331 


Hs.25458 


methionine sulfoxide reductase A 
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425288 


AA354502 




gb:EST62799 Jurkat T-ceDs V Homo sapien 
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Hs 150580 


PTD010 protein 
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rVJUOO 130 


Hs 22968 


Hnmn ^aniens rlnnn IMARF*451Q39 mRNA «;p 
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Hs. 13477 


ESTs, Weakly similar to 1207289A reverse 
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0.57 


0.48 
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Hs 126623 


ESTs 


2.02 


1.61 


445026 


W90337 


Hs. 2829 66 


ESTs, Moderately similar to 210926QA B c 
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2.23 


414182 


AA136301 


Hs.344442 


KIAA1 105 protein 


1.32 


2.55 


457048 


AA400352 


Hs.1 12861 


ESTs 


1.54 


2.05 


440542 


AA889143 


Hs.295655 


ESTs, Weakly similar to PC4259 ferritin 


1.48 


Z15 


422857 


R71461 




gb.7i51h07.r1 Soares placenta Nb2HP Homo 


1.42 


Z78 


445948 


AW444562 


Hs.202247 


ESTs 


1.50 


2.48 


454002 


BE299567 


Hs.271749 


ESTs, Moderately simOar to ALU8.HUMAN A 


1.31 


2.25 


413656 


T91703 




gb:ye20g09.s1 Stratagene lung (937210) H 


Z10 


1.69 


420441 


A1986160 


Hs. 160383 


dual specificity phosphatase 6 


0.99 


2.33 


412062 


H09124 


Hs.202341 


Homo sapiens cDNA: FU 23573 fis, done L 


2.14 


1.61 


408991 


BE501816 


Hs.281927 


ESTs 


1.76 


Z83 


432534 


AW361626 


Hs.339833 


hypothetical protein FU 1 1 240 


0.41 


0.28 


435136 


R27299 


Hs.10172 


ESTs 


0.76 


3.40 


451052 


AA281504 


Hs.24444 


Homo sapiens cDNA: FU22165 fis, clone H 


Z16 


1.85 


413928 


AA442498 
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AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA1 73345 AA1 99942 AA223384 AA227092 AA227080 T12379 AA092174 
T61139M149776M699829AVra79188AW813567AViffl13538AI267168M157718M157719M10M7^ 

AA157730 AA157715AA053524AW849581 AW854566 C05254 AW882836 T92637 AW812621 AA206583AA209204 BE1 56909 AA226824 

AI829309 AW991 957 N66951 AA527374 H66215 AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 

AW817705AW817703AW817659BE081531 H59570 

AW852235AF156166 R07008 

AA664078 AW363313 AA805009 

AA628980AI126603BE504035 

BE149154BE149151X60138 

AW975957 AA747943 AA81 1289 

A1082424 A1740586 AA771764 AA771B06 AI033879 BE500996 AW204531 D3B676 
W05391 X56197 H75313 AW770789 AA281642 AV654440 AA344646 R00244 T83378 

AW373062 T55662 A1299190 BE174210 AW579001 H0181 1 W401 86 R671 00 AI923886 AW952164 AA628440 AW898607 AW898616 
AA7091 26 AWB98S28 AW898544 AA947932 AW898625 AW898622 AI276125 AJ1 85720 AW510698 AA987230 T52522 BE467708 AW243400 
AW043642 AI288245 A1186932 D52654 D55017 D52715 D52477 053933 064679 AI298739 AI1 46984 AI922204 N98343 BE174213 AA845571 
AI813854 AI214518 AI635262 AI139455 A1707807 AI698085 AW884528 AI02476B AI004723 AW087420 AI565133 N94964 AI268939 
AW51 3280 A1061 1 26 AI43581 8 A1659106 A1360506 AI024767 AA513019 AA757598 X56196 AA902959 AI334784 AI860794 AA01 0207 
AW890091 AW513771 A1951391 A1337671 T52499 AA890205 A! 640908 H75966AA463487 AA358688A1961767AI866295 AA780994 
A1985913 BE174196 AAO29094 AW592159 T555B1 N79072 AI611201 AA910812AI220713 AW149306 AI75B412 AA045713 R79750 K76096 
L09078 L03145 L09094 L09098 103165 L09102 
M27346 

AF086410 W94386 W74609 
AW979249 063277 AA846968 
BE561384AW732707 

AJ744995 A1973099 R19291 AA935215 AA897534 AI69B070 BE179102 AI694912 
AW850980 AA969513 AW366793 

AI033188 BE004743 AW804074 BE004795 BE178560 BE089438 BE089378 BE089439 BE089437 BE089423 BE08942Q BE089425 BE089424 



Z43613 AA320191 AW954963 H15889 R83872 

AW01 6773 AI052778 AM52937 AW085293 

A!126098AI184746AI148521 

Z17870AL039867A1139927 

R44007 N54521 A1140476AW418849 T95308 

AI936264 AI219228 A1200740 BE30077 1 AI635739 A1690626 AW04401 0 R60842 R40388 
AW450439 AW297340 AI239849 AI61 31 1 9 
AW97401 3 AA557257 N36812 AI597998 AA873565 
BE158869 AI248241 BE1 58865 BE 158868 
R31107 A1341136 A1653198 H04953 

BE010189AW879041 BE008038 AW905325 AA343575 AW844209 AL038020 BE010133 BEO 10083 A W9 03901 BE174377AI309717AI349651 

AI903726 BE176661 AW905394 

AJ523875 R45782 R45781 

T94709AI820675A1732253 

R55428 AI820704 AI732283 R54983 

AI732321 R55640 R55639 AI820744 

AI903765 AI811194 BE007147 AW130760 

BE066552BE066341 A1907683 

AL043049 T95976 

AL137967BE064160BE064186 

AL138129AL138179BE064231 

AW753456 AW753036 AW854868 AW854862 

AW803980AW803974 

AW810204 AW810555 AW810196 AW81 081 9 AW810507 

AW81 1 1 14 AW81 1095 AW81 1087 AW81 1 1 24 AW81 1054 AW81 1094 AW81 1 1 57 

AW81 1324 AW81 1325 AW81 1325 AW81 1333 AW81 1329 AW81 1328 AW81 1332 AW81 1 339 AW81 1335 

AW81 1633 AW81 1652 AW81 1898 

AW81 2715 AW81 2646 AW81 2714 

AW854930 AW814431 AW814190 AW814115 AWB54941 

AVW15588AW815671AVV815493AW815898AW815613AVrai5490AVV815808AW8154M 
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454741 
454826 

454915 
455065 
455087 
455116 
455236 
455300 
455407 
455508 
455527 
455642 
455649 
455753 
455757 
455908 
455917 

455988 
456034 

456172 
456332 
456381 
457297 
457389 
457567 
457581 
457741 
457871 
457951 
459234 

TABLE 8C 



1232559 1 BE154396 AW817959 BE154393 

1235377~1 AW833676 AW833814AW833798AW833677 AW833449AW833630AW833626AW833444 AW833366AW833791 AW833659AW833432 

AW333534 AW833S56 AW833553 

1242343 1 AW841619 AW851958 AW851851 AW851985 

1251980~1 AW854352 AW854311 AW854340AW854461 

1252832J AW855389 AW855555 AW855420 

1254206J AW857271 AW857308 AW857296 AW857258 

1265662 1 AW875972 AW875983 AW875974 AW876000 AW875966 AW876050 

1 275482~1 AW891707 AW891696 AW891917 AW891 913 AW891912 AW891 909 AW89 1890 AW891892 AW891889 AW891 697 AW891880 AW891898 

1288347 J AW936813 AW936731 AW936728AW936600 AW936681 AW936651 

1318507 1 AW976165 C04000 

1 3221 25~1 AW984479 AW984498 AW984495 AW984477 AW984480 AW9B4504 

1348163~1 BE063965 BE063968 BE064034 BE064028 BE063874 BE063966 BE053869 BE064043 BE064033 BE063884 

1348708 1 BE065051 BE155165 BE064764 BE155231 BE064648 BE064671 BE064636 

1356070 1 BE075124 BE075229 BE075278 

1358657 J BE079531 BE079371 BE079372 8E079593 BE079468 BE079504 BE079505 

13B2301J BE156306 BE156188 BE156298 BE156377 BE156374 

1382784.1 BE156765 BE156770 BE156767 BE156769 BE156803 BE156802 BE156847 BE156853 BE155780 BE156836 BE156792 BE156B34 BE156779 

BE156789 BE156833 BE156844 BE156831 BE156849 BE156797 BE156784 BE156801 BE156843 BE156793 BE156852 

1397740 1 BE177983BE178322 

142696.T AW450979 AA136653 AA136656 AW419381 AA9B4358 AA492073 BE168945 AA809054 AW238038 BE011212 BE011359 BE011367 

BE011368 BE011362 BE011215 BE011365 BE01 1363 

1603643 1 R99050 R99067 H67642 

179104 T AA228357 AW841786 AW841716 

1B4123 1 AA236606 AA459341 AA237079 

313764 1 AW9681B8 AA468196 AA468269 AA468298 

331080 1 AW970989 AA502167 AA507546 _ 

357346*1 AW939074 AW939073 BE1 60476 AW939938 AW939206 AW940012 AW939076 AA573577 AW750479 AA574383 AW970057 

359936~1 AA578512 AA595535 BE177533 

395767~1 BE044740 AW827360 AW827623 BE161439 BE044718 BE046207 BE046551 AA653908 BE166581 

426637~1 AI168278 AA858238 BE550792 A1522194 A1B19707 AA973538 AI990086 AI628424 A1095270 AI991608 AA730741 

44251.1 U23860 U80739 

945240.-1 A1940425 



Pkey: Unique number corresponding to an Eos probeset 

Ref. Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et al refers to the publication enfitled Tlie DNA 

sequence of human chromosome 22." Dunham I. et aL Nature (1999) 402:48&495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition 


400704 


8116864 


Minus 


6311063241 


400830 


8570385 


Plus 


157683-163035 


400834 


8705192 


Rus 


121963-122288 


400840 


9188586 


Plus 


113882-114121 


400850 


1927150 


Minus 


4506-4691 


400881 


2842777 


Minus 


91446-91603.92123-92265 


401016 


8117441 


Rus 


126234-126359,128050-128236 


401090 


8492704 


Minus 


201281-201460 


401215 


9858408 


Plus 


103739-103919 


401241 


4827300 


Minus 


3050W0844.31056-31248 


401335 


9884861 


Plus 


15736-16352 


401381 


8570226 


Minus 


118629-119146,119392-119657 


401400 


7708226 


Minus 


33028-33585 


401469 


6682292 


Minus 


125521-125639 


401473 


7249001 


Plus 


115142-117305 


401577 


9280797 


Minus 


139377-139674,141195-141281,142217-142340 


401658 


9100864 


Plus 


89638-90028 


401659 


7689875 


Minus 


183379-183521 


401686 


6468551 


Plus 


5005-5426,6810-7042 


401723 


7656694 


Rus 


147273-147503 


401814 


7408052 


Plus 


136003-136726 


401818 


7467933 


Minus 


10964-11084,11674-11817 


401890 


8516144 


Rus 


148955-149396,149569-150002 


401908 


8698760 


Minus 


126888-127024 


401913 


9369520 


Minus 


33753-33904 


401927 


3873185 


Minus 


112000-112137 


402025 
402039 


7547159 


Rus 


173835-173998 


7770432 


Rus 


560-1294 


402049 


8072512 


Rus 


100065-100419 


402085 


7249154 


Rus 


90533-90687,94949-95158 


402241 


7690131 


Minus 


125073-125206,130996-131125 


402294 


2282012 


Minus 


2575-3000 


402305 


7328724 


Rus 


40832-41362 
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9454515 


Plus 


195808*1 96863 






QDCC7QO 


Minus 


o/o4o-37&33 






8076B79 


Plus 


44058*44803 




402665 


8318556 


Plus 


58962-59294 


402762 


9230904 


Minus 


123298-124035 




402797 


3421043 


Minus 


15758-15930 




402833 


8918545 


Plus 


26987-27778 




402901 


8894222 


Minus 


175426-175667 


i n 


402948 


9368458 


Minus 


1 43456-143626, 1 43808-1 43935 


403066 


B954202 


Plus 


158189-158433 




403072 


8954241 


Plus 


141829-142005 




403146 


9739812 


Plus 


162877-1631 18 




403197 


9930749 


Plus 


79990-80237 


i ^ 
i j 


403214 


7630945 


Minus 


76723-77027,79317-79484 


403217 


7630969 


Phis 


54089-54163,55427-55623 




403290 


8083176 


Plus 


19288-20076 




403291 


7230870 


Plus 


95177-95435 




403294 


8098496 


Plus 


41565-41881 


on 


403315 


8247953 


Minus 


125117-125287 


403332 


8568139 


Minus 


31409-31674 




403344 


8569726 


Plus 


70823-70990 




403362 


8571772 


Plus 


64099-64260 




403371 


9087278 


Plus 


105655-106050 


L J 


403391 


9438337 


Plus 


42410-42544.83317-83540,86840-86922,87970-88110 


403488 


9966615 


Minus 


12450-12753 




403536 


8076924 


Pius 


34972-35182 




403779 


8018040 


Minus 


95602-95989 




403B59 


7708954 


Ptus 


11373B-1 13858 




403871 


7709262 


Plus 


104545-104757 


403903 


7710671 


Minus 


101165-102597 




403917 


7710849 


Ptus 


1 097 1 8-1 09847,1 09927- 1 10202 




403978 


8576014 


Plus 


97326-97808 




404031 


7671252 


Plus 


171477-172316 


J J 


404167 


9926594 


Minus 


77030-77280 


404220 


6706820 


Plus 


46107-46439 




404286 


2326514 


Plus 


51086-51301 




404418 


7382420 


Minus 


153339-153481,155099-155294 




404427 


7407959 


Plus 


127170-127358 


Ad 


404429 


7407979 


Plus 


31352-31498 


404440 


7528051 


Plus 


80430-81561 




404495 


8151634 


Minus 


59449-60477 




404580 


6539738 


Minus 


240588-241589 




404606 


9212936 


Minus 


22310-23269 


/I C 

4!) 


404730 


8389582 


Plus 


119832-120016,124110-124275 


404917 


7341851 


Plus 


49330-49498 




405033 


7107731 


Minus 


142358-142546 




405137 


8570507 


Ptus 


158969-159423 




405146 


9438278 


Minus 


102529-102633 


50 


405158 


9966252 


Plus 


4287343056,43815-43949 


405187 


7229825 


Pius 


1 17025-1 1 7170,1 18567-1 18736 




405223 


7239614 


Plus 


106184-106313 




405340 


6094635 


Ptus 


49644-49760 




405494 


8050952 


Minus 


70284-70518 


c c 


405551 


1552506 


Pius 


12525-12997 


405654 


4895155 


Minus 


53824-53759 




405667 


4726099 


Plus 


5798-5914 




405673 


4589984 


Plus 


50700-50842 




405704 


4204244 


Plus 


138842-139051 


oU 


405723 


9801668 


Plus 


114896-115831 


405727 


9838331 


Minus 


78865-79664 




405760 


6066938 


Minus 


37424-38045 




405779 


7280331 


Minus 


33048-33856 




405944 


7883702 


Minus 


5143-5684 


65 


406002 


8247797 


Minus 


154007-154579 


406016 


8272661 


Plus 


41341-41940 




406097 


7107918 


Minus 


36698-37269 




406104 


9124028 


Plus 


35309-35977 




406156 


7144867 


Ptus 


379-597 


70 


406207 


5923650 


Minus 


162607-162800 


406300 


6479046 


Minus 


19234-19401 




406308 


9211532 


Plus 


358408-358651 




406314 


9211609 


Minus 


12899-13011,18022-18136 




406317 


9211652 


Rus 


108018-108410 


75 


406432 


9256504 


Plus 


3804-3930,4026-4120.4929-5109 


406490 


7711309 


Minus 


80295-80480 




406584 


3983530 


Minus 


39894497 
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TABLE 9A: Genes predictive of no bladder cancer progression 

Ptey: Unique Eos probesel Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

Unlgenelp. Unigene number 

Unigene Title: Unigene gene title 

R1 80th percentile of Ta or T1 tumor Als from patients who did not upstage divided by the 80th percentile of Ta or T1 tumor Als from patients who did upstage 
5? median of Ta or T1 tumor Als from patients who did not upstage divided by the median of Ta or T1 tumor Als from patients who upstaged 



Pkey 


ExAccn 


UnigertelD 


Unigene Title 


R1 


R2 


408000 


111690 


Hs.198589 


bullous pemphigoid antigen 1 (23G/240kD) 


4.64 


5.88 


412129 


M21984 


Hs.73454 


troponin T3, skeletal, fast 


4.54 


5.10 


459290 


NMJ)01546 


Hs.34853 


Inhibitor of ONA binding 4, dominant neg 


4.37 


1.63 


400844 






NNL003105*:Horno sapiens sortBrwelated 


3.69 


5.90 


419555 


AA244416 




gb:nc07d11.s1 NCLCGAPJ'rl Homo sapiens 


3.61 


Z03 


414522 


AW518944 


Hs.76325 


step 11 splicing factor SLU7 


3.60 


1.00 


440509 


BE410132 


Hs.134202 


ESTs, Weakly similar to T17279 hypotheti 


3.58 


1.04 


445182 


AW189787 




ESTs 


3.57 


Z70 


407151 


H25836 


Hs.301527 


ESTs, Moderately similar to unknown [H.s 


3.56 


1.48 


421314 


BE440002 


Hs.180324 


Homo sapiens, clone 1MAGE:4183312. mRNA, 


3.28 


125 


429863 


M68874 


Hs.211587 


phosphofipase A2, group JVA (cytosofc, 


3.25 


Z50 


430702 


U56979 


Hs.278568 


H factor 1 (complement) 


3.20 


Z70 


412420 


AL03566B 


HsJ3B53 


bone morphogenetic protein 2 


3.20 


Z30 


420729 


AW964B97 


Hs.290825 


ESTs 


3.20 


1.53 


433376 


A1249361 


Hs.74122 


caspase 4, apoptosis-related cysteine pr 


3.00 


4.10 


420028 


AB014680 


Hs.8786 


carbohydrate (M-a»rylg!u(»sarriine-6-0) s 


2.94 


Z78 


407881 


AW072003 


Hs.40968 


heparan sulfate (glucosamine) 3-O-sutfol 


2.93 


1.43 


426283 


NM.003937 


Hs.169139 


kynureninase (L-kynurenine hydrolase) 


2.93 


1.33 


428030 


A1915228 


Hs. 11493 


Homo sapiens cDNA FU 13536 fts. done PL 


2.92 


Z47 


419713 


AW968058 


Hs.92381 


nudix (nucleoside diphosphate linked moi 


2.89 


3.33 


414407 


AA147026 


Hs.76704 


ESTs 


2.87 


2.87 


450779 


AW204145 


Hs.155044 


ESTs 


Z78 


1.86 


411243 


AB039888 


Hs.69319 


CA11 


Z73 


too 


417878 




Hs.82845 


Homo sapiens cDHA: FU21930 fo, clone H 


£68 


Z32 


441619 


NM.014O56 


Hs.7917 


DKFZP564K247 protein 


2.67 


Z98 


446619 


AU076643 


Hs.313 


secreted phosphoproteln 1 (osleopontin, 


Z66 


1.00 


440006 


AK000517 


Hs.6844 


hypothetical protein FU 20510 


2.64 


1.77 


426252 


BE176980 


Hs.28917 


ESTs 


2.63 


7.30 


424008 


R02740 


Hs.137555 


putative chemokine receptor; GTP-btnding 


2.60 


Z53 


429429 


AA629725 


Hs.334437 


hypothetical protein MGC4248 


2.59 


3.34 


427450 


AB014526 


Hs.178121 


KIAA0626 gene product 


2.57 


Z28 


420180 


AI004035 


Hs.25191 


ESTs 


Z56 


1.68 


434061 


AW024973 


Hs.283675 


NPD009 protein 


2.54 


2.10 


422070 


AF1497B5 


Hs.111126 


pituitary tumor-transforming 1 interacti 


2.54 


3.25 


419355 


AA428520 


Hs.90061 


progesterone binding proton 


2.53 


3.63 


446215 


AW821329 


Hs.14368 


SH3 domain binding glutamic acid-rich pr 


2.52 


4.38 


432442 


A1672516 


Hs.178485 


' ESTs, Weakly similar to S65657 alpha-1C- 


2.50 


4.60 


447887 


AA114050 


Hs.19949 


caspase 8, apoptosis-related cysteine pr 


2.49 


2.23 


401155 






Target Exon 


Z4B 


Z30 


404530 






Target Exon 


Z48 


1.00 


446006 


KM.004403 


Hs.13530 


deafness, autosomal dominant 5 


Z48 


Z35 


436476 


AA326108 


Hs.33829 


bHLH protein DEC2 


2.47 


Z88 


446535 


AF257175 


Hs.15250 


peroxisomal D3,D2-enoy}-CoA isomerase 


2.46 


Z19 


408636 


BE294925 


Hs.46680 


CGU2 protein 


2.45 


1.60 


420962 


NMJ305904 


Hs.100602 


MAD (mothers against dec apentaplegjc, Dr 


Z44 


3.75 


427008 


Z45258 


Hs.286013 


short coiled-coil protein 


2.42 


3.40 


459711 


BE386801 


Hs.21858 


trinucleotide repeat containing 3 


Z40 


Z78 


407910 


AA550274 


Hs.41296 


tibronectin leucine rich transmembrane p 


2.40 


1,00 


410337 


M83822 


Hs.62354 


cell division cycle 4-fike 


Z39 


3.88 


435029 


AF167706 


Hs.19280 


cysteine-nch motor neuron 1 


Z39 


3.23 


437181 


AI306615 


Hs.125343 


ESTs, Weakly similar to KWA0758 protein 


Z39 


1.00 


410968 


AA199907 


Hs.67397 


homeo box A1 


Z38 


1.33 


422511 


AU076442 


Hs.1 17938 


collagen, type XVI), alpha 1 


2.38 


6.40 


450775 


AA902384 


Hs.73853 


bone morphogenetic protein 2 


Z38 


Z71 


442433 


BE243044 


Hs.8309 


K1AA0747 protein 


Z37 


3.68 


454000 


AA040620 


Hs.5672 


hypothetical protein AF140225 


2.38 


1.14 


447701 


BE619526 


Hs.255527 


hypothetical protein MGC14128 


2.36 


Z02 


427985 


AI770170 


Hs.29643 


Homo sapiens cONA FU13103 lis, clone NT 


2.38 


Z18 


442257 


AW503831 


Hs.323370 


Human EST clone 25267 mariner transposon 


2.35 


3,53 


454070 


N79110 


Hs.21276 


collagen, type IV, alpha 3 (Goodpasture 


2.35 


3.60 


418452 


BE379749 


Hs.85201 


C-type (calcium dependent, carbohydraie- 


Z35 


Z48 


421218 


rW_00O499 


Hs.72912 


cytochrome P450, subfamily 1 (aromatic c 


2.35 


1.00 


407793 


AW080879 


Hs.236572 


gbacc3Bg04jc1 NCI_GGAP_Co20 Homo sapiens 


Z35 


1.21 


442061 


AA774284 


Hs.285728 


aW-interactor 12 (SH3<ontaining protei 


2.34 


3.03 


402845 






ENSP(KKKW246267:K{AA0444 PROTEIN (FRAGME 


Z34 


1.52 


411407 


R00903 


Hs.169793 


ribosornal protein L32 


Z34 


0.77 


418506 


AA084248 


Hs.85339 


G protein-coupled receptor 39 


Z34 


0.72 
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10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



424637 


NM.O15057 
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201 



1.92 


2.15 


1.92 


2.10 


1.92 


4.43 


1.92 


4.00 


1.92 


Z13 


1.92 


Z58 


1.92 


178 


1.91 


2.32 


1.91 


1.88 


1.91 


1.61 


1.90 


3.35 


1.90 


3.55 


1.80 


1.46 


1.90 


3.65 


1.89 


1.73 


1.89 


279 


1.89 


3.84 


1.89 


1.00 


1.89 


2.51 


1.88 


2.35 


1.88 


1.00 


1.87 


2.99 


1.87 


1.13 


1.87 


3.00 


1.86 


£93 


1.86 


1.90 


1.86 


2.80 


1.66 


1.32 


1.85 


1.44 


1.85 


2.64 


1.85 


1.81 


1.85 


2. SO 


1.85 


110 


1.84 


3.70 


1.84 


2.58 


1.84 


1.52 


1.84 


3.08 


1.84 


1.20 


'1.84 


2.35 


1.84 


2.18 


1.84 


218 


1.84 


1.98 


1.84 


1.00 


1.83 


3.93 


1.83 


3.10 


1.83 


3.03 


1.83 


1.78 


1.82 


1.53 


1.82 


1.00 


1.61 


1.57 


1.81 


205 


1.81 


1.98 


1.80 


1.91 


1.80 


4.88 


1.80 


261 


1.80 


233 


1.80 


3,17 


1.80 


1.00 


1.79 


266 


1.79 


230 


1.79 


200 


1.79 


3.92 


1.79 


1.24 


1.79 


239 


1.78 


202 


1.78 


0.96 


178 


260 


1.78 


2.47 


1.78 


217 


1.78 


313 


1.78 


1.55 


1.78 


236 


1.78 


220 


1.78 


203 


1.77 


229 


1.77 


293 


1.77 
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400773 






NM_003105*:Hon» sapiens soriiftwelated 


1.77 


1.76 


400175 






Eos Control 


1.77 


2.04 


421501 


M29971 


Hb.1384 


O-fctnethylguanine-DNA rralhyltransferase 


1.77 


2.32 


451234 


A1914901 


Hs.24052 


ESTs, Weakly similar to 138022 hypoiheB 


1.77 


243 


423332 


AI091466 


Hs.1 27241 


sorting nexin 7 


1.76 


1.82 


423960 


AA164516 


Hs.136309 


SH3-conlalning protein SH3GLB1 


1.76 


2.00 


450489 


AJ697990 


Hs.346002 


ESTs 


1.75 


3.15 
237 


457265 


AB023212 


Hs.225967 


K1AA0S95 protein 


1.76 


413076 


U10564 


Hs.75188 


weel (S. pombe) homotog 


1.75 


118 


421948 


L42583 


Hs.334309 


keratin 6A 


1.75 


1.00 


453578 


R06875 


Hs.81810 


ESTs 


1.75 


3.10 


412430 


AW675054 


Hs.73875 


fumarylacetoacelale hydrolase (tumarylac 


1.75 


214 


439396 


BE562958 


Hs.74346 


hypothetical protein MGC14353 


1.75 


1.78 


431448 


AL137517 


Hs.306201 


hypothetical protein DKFZp56401278 


1.75 


236 


44953B 


AI559444 


Hs.104679 


ESTs 


1.75 


3.07 


453146 


AI338952 


Hs.32194 


ESTs 


1.74 


282 


426122 


NM.006925 


Hs.166975 


splicing factor, arginlne/serine-rich 5 


1.74 


288 


408989 
441715 


AW361666 


Hs.49500 


K1AA0746 protein 


1.74 


2.07 


AI92S453 


Hs.342655 


Homo sapiens cONA FUl3289fis, done OV 


1.74 


2.06 


412718 


X79204 


Hs.74520 


spinocerebellar ataxia 1 (olivoponlocere 


1.74 


246 


450798 


AW167780 


Hs.50438 


ESTs 


1.74 


202 


445537 


AJ245671 


Hs.12844 


EGF-like-oomain, multiple 6 


1.73 


258 


400190 






Eos Control 


1.73 


240 


416309 


R84694 


Hs.79194 


cAMP responsive element binding protein 


1.73 


1.48 


410219 


T98226 


Hs.171952 


occtudin 


1.73 


275 


419814 


AW402478 


Hs.93213 


Ba2-anJagonist/killer 1 


1.73 


270 


448625 


AW970786 


Hs.178470 


hypothetical protein FU22662 


1.73 


207 


422387 


AA309996 


Hs.148656 


ESTs, WeaWy similar to T12453 hypotheti 


1.73 


202 


417386 


AL037228 


Hs.82043 


D1 23 gene product 


1.73 


244 


405812 






Target Exon 


1.72 


294 


436270 


C03769 


Hs.339669 


Homo sapiens, clone I MAGE: 3947554. mRNA, 


1.72 


285 


409855 


AW502461 




gb:Ul-HF^R0p^v-W)8^Ulj1 NlH_MGC_5 


1.72 


263 


411442 


N25956 


Hs.101810 


Homo sapiens cDNA FU 14232 fls. done NT 


1.72 


1.88 


400846 






sortilirwelated receptor, L(DLRdass) 


1.72 


1.63 


401660 






Target Exon 


1.72 


263 


402190 






C19000835*:giI10946730lreilNP_06736Z1| 


1.72 


3.33 


439191 


AA281177 


Hs.41182 


Homo sapiens DC47 mRNA, complete cds 


1.71 


217 


410444 


W73484 


Hs.132554 


gb:zd54e04.s1 SoaresJetaLhearLNbHH19W 


1.71 


270 


430393 


BE185030 


Hs.241305 


estrogen-responsive B box protein 


1.71 


1.33 


445056 


A1343931 


Hs.149383 


ESTs 


1.71 


232 


411299 


BE409857 


Hs.69499 


hypothetical protein 


1.71 


2.92 


408246 


N55669 


Ha.333823 


rriHDchondriaJ rfbosomaJ protein L13 


1.71 


200 


454054 


AI336329 


Hs.301519 


Homo sapiens cDNA FU12536 lis, done NT 


1.71 


1.93 


417381 


AF164142 


Hs.82042 


solute carrier family 23 (nudeobase Ira 


1.70 


3.70 


427820 


BE222494 


Hs.180919 


inhibitor of DMA binding 2, dominant neg 


1.70 


1.60 


400750 






Target Exon 


1.70 


282 


455842 


BE145837 




gb:MRO-HT020B.101299-202-c07 HT0208 Homo 


1.70 


217 


429956 


BE081342 


Hs.283037 


HSPC039 protein 


1.70 


1.18 


418444 


A1902899 


H3.85155 


butyrate response factor 1 (EGF-response 


1.70 


247 


437450 


AL390154 


Hs.26954 


Homo sapiens mRNA; cDNA DKFZp762G123 (fr 


1.70 


3.03 


415738 


BE539367 


Hs.295953 


ESTs, Weakly similar to AF220049 1 uncha 


1.70 


234 


405245 






Target Exon 


1.70 


1.99 


408483 


AA464836 


Hs.291079 


ESTs, WeaWy similar to T27173 hypotheti 


1.70 


2.05 


413611 


BE153275 




gb:PM(W0335-18()40a00^e11 HT0335 Homo 


1.70 


205 


410190 


AW072328 


Hs.59728 


Homo sapiens mRNA; cDNA DKFZp566C0546 (f 


1.69 


2.20 


434608 


AA805443 


Hs.179909 


hypothetical protein FU22995 


1.69 


236 


432170 


T56887 


Hs.18282 


KIAA1 134 protein 


1.69 


1.83 


448182 


AF244137 


Hs.20597 


host cell factor homolog 


1.69 


211 


436293 


A1601188 


Hs.120910 


ESTs 


1.69 


2.37 


448524 


AB03234B 


Hs.21356 


hypothetical protein DKFZp762K2015 


1.68 


248 


404231 






Target Exon 


1.68 


250 


453906 


AW444952 


Hs.257054 


ESTs 


1.68 


2.45 


437967 


BE277414 


Hs.5947 


meJ transforming oncogene (derived from 


1.68 


1.00 


426125 


X87241 


Hs.166994 


FAT tumor suppressor (Drosophila) homolo 


1.68 


asi 


448813 


AF169802 


Hs.22142 


cytochrome b5 reductase b5R2 


1.68 


1.69 


429162 


AK00125O 


Hs.197642 


hypothetical protein FU 10388 


1.68 


3.13 


425556 


H27225 


Hs.9444 


hypothetical protein FU13114 


1.67 


202 


405630 






Target Exon 


1.67 


3.00 


421405 


AA251944 


Hs.104058 


CGI-29 protein 


1.67 


3.25 


422640 


M37984 


Hs.118845 


troponin C, stow 


1.67 


1.23 


450857 


AA629075 


Hs.190090 


ESTs 


1.67 


2.48 


451668 


Z43948 


Hs.326444 


cartilage addle protein 1 


1.66 


255 


433821 


AW182416 




ESTs 


1.66 


265 


405595 






NM_000721*:Homo sapiens caMum channel. 


1.66 


223 


433892 


AI929357 


Hs.323966 


Homo sapiens done H53 unknown mRNA 


1.66 


1.97 


443558 


AA376798 


Hs.286122 


MOSQ24 protein 


1.66 


2.00 


412141 


AI183838 


Hs.48938 


hypothetical protebi FU21802 


1.66 


265 
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WO 03/003906 



PCT7US02/21338 



10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



424685 


W21223 


Hs.151734 


nuclear transport factor 2 {placental pr 


1.66 


2,88 


400845 






NMJ)03105*:Homo sapiens sorOIirweiated 


1.66 


1.61 


447816 


NM_007233 


Hs.274329 


TP53 target gene 1 


1.66 


2.63 


404438 






Target Exon 


1.66 


2.34 


451543 


AA397651 


Hs.301959 


proline synthetase co-transcribed (bade 


1.65 


2.08 


433233 


AB040927 


Hs.301804 


KIAA1494 protein 


1.65 


3.13 


420938 
435438 


AL049698 


Hs.100469 


myelotd/tymphotd or mixed-lineage teukem 


1.65 


1.37 


K84421 


Hs.4890 


ubiquitin-conjugatlng enzyme E2E 3 (homo 


1.65 


2.35 


431130 


NMJW61Q3 


Hs.2719 


HE4; epidldymis-sperific, whey-acidic pr 


1.65 


1.00 


433235 


AB040929 


Hs.35089 


contact)/) 3 (plasmacytoma associated) 


1.65 


1.44 


439632 


AW410714 


Hs.334437 


hypothetical protein MGC4248 


1.65 


235 


409324 


W76202 


Hs.343812 


lipotc acid synthetase 


1.65 


2.00 


452207 


NMJJ14517 


Hs.28423 


upstream binding protein 1 (LBP-1a) 


1.65 


2.33 


423630 


AB011132 


Hs.129952 


KIAA0560 gene product 


1.65 


213 


443358 


H65417 


Hs.17757 


pleckstrin homology domain-containing, f 


1.65 


1.63 


427417 


AA341061 


Hs.177861 


CGM 10 protein 


1.64 


1.28 


450353 


AI244661 


Hs.103296 


ESTs, Weakly similar to S65657 aJpha-IC- 


1.64 


1.60 


445677 


H96577 


Hs.6838 


ras homoiog gene family, member E 


1.64 


1.91 


447503 


AA115496 


Hs.336898 


Homo sapiens. Similar to RIKENcDNA 1810 


1.64 


204 


431234 


AL389985 


Hs.301637 


zinc finger protein 258 


1.64 


1.53 


41B032 


AW964695 


Hs.9436 


Homo sapiens, done MGC: 15763, mRNA, com 


1.64 


205 


407796 


AA195509 


Hs.39733 


postsynaptic protein CRIPT 


1.64 


230 


446298 


AF187813 


Hs.14637 


kidney- and liver-specific gene 


1.64 


2.05 


439578 


AW263124 


Hs.315111 


nuclear receptor co-repressor/HDAC3 comp 


1.64 


2.26 


429113 


D28235 


Hs.196384 


prostaglandin-endoperoxide synthase 2 (p 


1.64 


210 


433646 


AA603319 


Hs.155195 


ESTs 


1.64 


205 


4077B3 


AW996872 


Hs.172028 


a disintegrin and metafloproteinase doma 


1.64 


1.00 


419982 


AA252544 


Hs.55610 


solute carrier family 30 (zinc transport 


164 


216 


401603 






NM_022041*:Homo sapiens giant axonai neu 


1.64 


273 


431604 


AF175265 


Hs.264190 


vacuolar protein sorting 35 (yeast homoi 


1.64 


275 


400788 






C6000994*:gi|10435784|dbi|BAB14668Ll| (A 


1.63 


204 


416221 


BE513171 


Hs.79086 


mitochondrial ribosomal protein L3 


1.63 


264 


422491 


AA33B548 


Hs.117546 


neuronatin 


1.63 


0.95 


424737 


BE301B83 


Hs.152707 


glioblastoma amplified sequence 


1.63 


3.45 


416078 


AL034349 


Hs.79005 


protein tyrosine phosphatase, receptor t 


1.63 


1.39 


403988 






C5001831:gi|11056014irefJNP_067651.1| ac 


1.62 


211 


411486 


N85785 


Hs.181165 


eukaryotic translation elongation factor 


1.62 


263 


407874 


AI766311 


Hs.289047 


Homo sapiens cONA FU14059 fis. done HE 


1.62 


219 


446700 


AW206257 


Hs.156326 


Human DNA sequence from clone RP1 1-145L2 


1.62 


3.03 


438184 


AA779897 


Hs.122125 


ESTs 


1.62 


279 


405502 






C7000609*:gi|628012JplrlIA53933 myosin I 


1.62 


255 


447050 


NM.016314 


Hs.17200 


STAM-iike protein containing SH3 and ITA 


1.62 


24B 


457961 


AA772119 


Hs.270721 


ESTs, WeaWy similar to 138022 hypolheti 


1.62 


230 


436774 


AW975810 


Hs.159054 


hypothetical protein FU 13224 


1.62 


217 


414893 


AA215295 


Hs.77578 


ubiquitin specific protease 9, X chromes 


1.62 


203 


458660 


AI299739 


Hs.99601 


hypothetical prolan FU 12553 


1.62 


225 


405806 






Target Exon 


1.62 


215 


421205 


AL137540 


Hs.102541 


netrto4 


1.62 


1.00 


424012 


AW368377 


Hs.137569 


tumor protein 63 kOa with strong homoiog 


1.62 


1.74 


427016 


AA397525 


Hs.191579 


ESTs 


1.61 


216 


458182 


A1147996 


Hs.155833 


ESTs, Weakly similar to spliceosomal pro 


1.61 


274 


451109 


F11875 


Hs.5534 


Homo sapiens cDNA FU 12961 fis, clone NT 


1.61 


2.59 


414807 


AI738616 


Hs.77348 


hydroxyprostaglandin dehydrogenase 15-{N 


1.61 


1.78 


456508 


AA502764 


Hs.123469 


ESTs, Weakly similar to AF208855 1 BM-01 


1.61 


210 


447532 


AK000614 


Hs.18791 


hypothetical protein FU20607 


1.61 


1.75 


439944 


AA856767 


Hs.124823 


ESTs 


1.61 


241 


414692 


H06831 


Hs.164557 


ESTs, Moderately similar to ALUC.HUMAN 1 


1.60 


3.05 


433187 


R53995 


Hs.293381 


ESTs, Moderately similar to ALU7.HUMAN A 


1.60 


263 


446825 


BE266822 


Hs.344097 


filamln A, alpha (actln-blndlng pratein- 


1.60 


243 


441166 


AA921738 


Hs.132473 


ESTs 


1.60 


2.69 


425571 


AJ007292 


Hs.158306 


ephruvA2 


1.60 


1.49 


406836 


AW514501 


Hs.156110 


irnrrrunogtabulrn kappa constant 


1.60 


1.08 


432374 


W68B15 


Hs.301885 


Homo sapiens cONA FU11346 (Is, done PL 


1.60 


1.47 


449268 


AW369278 


Hs^3412 


hypothetical protein FU20160 


1.50 


2.89 


400772 






NM_003105*:Homo sapiens sortifirHelated 


1.60 


257 


445733 


BE295568 


Hs.13225 


UDP-Gal:betaGlcNAc beta 1,4- galactosylt 


1.60 


2.03 


428172 


U09367 


Hs.182828 


zinc finger protein 136 (clone pHZ-20) 


1.60 


268 


421687 


AW161450 


Hs.109201 


CGI-86 protein 


1.59 


1.39 


418127 


BE243982 


Hs.83532 


membrane co factor protein (C046, trophob 


1.59 


1.67 


400297 


AI127076 


Hs.306201 


hypothetical protein DKFZp56401278 


1.59 


219 


434938 


AW500718 


Hs.8115 


Homo sapiens, clone MGC:16169, mRNA, com 


1.59 


226 


417924 


AU077231 


Hs.82932 


eyefin D1 (PRAD1: parathyroid aoenomatos 


1.59 


1.76 


418067 


AI127958 


Hs.83393 


cyslatin E/M 


1.59 


1.26 


427127 


AW802282 


Hs.22265 


pyruvate dehydrogenase phosphatase 


1.59 


225 


451938 


AI354355 


Hs.16697 


down-regulator of transcription 1, TBP-b 


1.59 


210 


407325 


AA291180 


Hs.328476 


ESTs, Weakly similar to alternatively sp 


1.58 


243 


410796 


Z44547 


Hs.3731 


ESTs, Moderately similar to 138022 hypot 


1.58 


1.26 
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417343 


AA197132 


Hs.231581 


myosin, heavy polypeptide 1, skeletal mu 


1.58 


184 


416543 


U62531 


Hs.79410 


solute carrier famfly 4, anion exchanger 


1.58 


1.26 


400847 


AW605927 


Hs.5306 


NM_003105*:Homo sapiens sortifin-related 


1.58 


1.48 


436760 


hypolhefical protein DKFZp586F1 122 shrill 


1.57 


1.57 


433427 


AI816449 


Hs.171889 


chdmephosprralransferase 1 


1.57 


1.64 


451988 


BE246996 


Hs.318401 


hypothetical protein DKFZp564D137B 


1.57 


1.83 


428901 


AI929568 


Hs.146668 


KIAA1253 protein 


1.57 


Z23 


426028 


NM.001110 


Hs.172028 


a disintegrin and melaDoproteinase doma 


1.57 


3.07 


444604 


AW327695 


Hs.11441 


chromosome 1 open reading frame 8 


1.57 


1.86 


439686 
426996 


W40445 


Hs.235857 


ESTs, Weakly similar to I38022 hypothec" 


1.57 


3.07 


AW968934 


Hs.173106 


Homo sapiens cDNA: aJ21897 fis, clone H 


1.57 


2.01 


447343 


AA256641 


Hs.236894 


ESTs, Highly srmilar to S02392 a!pha-2*n 


1.57 


283 


418942 


Al 556004 


Hs.141269 


Homo sapiens cDMA FU21550 fis. done C 


1.57 


1.21 


418555 


AI417215 


Hs.87159 


hypothetical protein RJ12577 


1.56 


3.08 


402368 






NM-021155":Homo sapiens CD209 antigen (C 


1.56 


2.05 


419749 


X73608 


Hs.93029 


sparc/osteonectin, cwcv and kazaHike d 


1.56 


2.08 


404977 






Insulin-like growth factor 2 (somatomedi 


1.56 


5.50 


441872 


BE567100 


Hs.154938 


hypothetical protein MDS025 


1.56 


230 


415503 


U36601 


HsJ8473 


N-deacetylasa/N-sultotransferase (he para 


1.56 


2.56 


451743 


AW074266 


Hs.23071 


ESTs 


1.56 


1.85 


423184 


NMJXJ4428 


Hs.1624 


ephrin-A1 


156 


1.41 


408041 


AW1 38782 


Hs.243607 


ESTs 


1.56 


221 


416777 


AF146760 


Hs.79844 


DKFZP564M1416 protein 


1.56 


2.00 


428013 


AF151020 


Hs.181444 


hypothetical protein 


1.56 


1.53 


410072 


BE384447 


Hs.16034 


hypothetical protein MGC13186 


1.55 


1.52 


411495 


AP00C693 


Hs.70359 


WAA01 38 protein 


1.55 


283 


408162 


AA993833 


Hs.1 18527 


ESTs 


1.55 


270 


413350 


U02556 


Hs75307 


t-complex-associated-testis-ex pressed 1* 


1.55 


1.99 


422010 


AA302049 


Hs.31181 


Homo sapiens cDNA FU23230 fis, clone C 


1.55 


1.60 


425229 


AU076961 


Hs.155212 


methylmalonyl Coenzyme A mutase 


1.55 


257 


425184 


BE278288 


Hs.155048 


Lutheran Wood group (Auberger b antigen 


1.55 


1.45 


419011 


H56244 


Hs.89552 


glutathione S- transferase A2 


1.55 


277 


417538 


AW050865 


Hs.275711 


hypothetical prolan MGC2452 


1.55 


276 


409806 


AW500960 




gb:UmF-BP0p.aiy-b^1^Ulj1 N!H_MGC_5 


1.55 


245 


402737 






Target Exon 


1.54 


258 


419B25 


AI754011 


Hs.7326 


ESTs 


1.54 


1.00 


410001 


AB041036 


Hs.57771 


kallikrein 11 


1.54 


0.62 


407813 


AL120247 


Hs.40109 


KIAA0872 protein 


1.54 


233 


415906 


AJ751357 


Hs.288741 


Homo sapiens cDNA: FU22256 lis, dona H 


1.54 


277 


427886 


AA417083 


Hs.104789 


ESTs 


1.54 


260 


437018 


AA889078 


Hs.187033 


ESTs 


1.54 


248 


415049 


N67334 


Hs.50158 


ESTs 


1.54 


257 


422315 


U16296 


Hs.1 15176 


T-celi rymchoma invasion and metastasis 


1.54 


257 


413715 
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1.37 


225 


443856 


AK000574 


Hs.9908 


nitrogen fixation cluster-tike 


1.37 


1.28 


453489 


AA300067 


Hs.33032 


hypothetical protein DKFZp434N185 


1.37 


2.15 


424870 


W61215 


Hs.1 16851 


epithelial V-tike antigen 1 


1.37 


1.66 


428995 


AW004975 


Hs.194716 


MAO (mothers against decapentaplegic, Dr 


1.37 


1.33 


441551 


AA318224 


Hs.296141 


ESTs 


1.37 


295 


45052B 


NMJ)14072 


Hs.25063 


PRO0461 protein 


1.37 


1.19 


4Z7605 


NO00997 


Hs.337445 


ribosomai protein L37 


1.37 


1.31 


459237 


AA031675 


Hs.31917 


Homo sapiens, done M6G9658, mRNA, comp 


1.37 


250 


413691 


AB023173 


Hs.75478 


ATPase, aassVl,type11B 


1.37 


1.31 


404906 






NKt025213:Homo sapiens spectrin, beta, n 


1.36 


3.08 


436246 


AW450963 


Hs.1 19991 


ESTs 


1.36 


1.00 


441478 


AA350018 


Hs.301342 


hypothetical protein MGC4342 


1.36 


1.43 


419715 


AF070523 


Hs.92384 


vitamin A responsive; cytosketeton relat 


1.35 


1.28 


426251 


M24283 


Hs.168383 


intercellular adhesion molecule 1 (CO 54) 


1.36 


216 


400129 






Eos Control 


1.36 


203 


450447 


AF212223 


Hs.25010 


hypothetical protein P15-2 


1.36 


213 


434697 


AL133033 


Hs.4084 


WAA1025 protein 


1.36 


201 


430308 


BE540865 


Hs.238990 


cyc&Kiependsnt kinase inhibitor 1B (p2 


1.36 


203 


434767 


AF153201 




C2H2 (KruppeHype) zinc finger protein 


1.36 


287 


459729 


AL037285 


Hs.289848 


EST, Weakly simflar to ALU4 JWMAN ALU SU 


1.36 


1.27 


426653 


AA530892 


Hs.171695 


dual specificity phosphatase 1 


1.35 


220 


408912 


AB011084 


Hs.48924 


K1AA0512 gene product ALEX2 


1.35 


268 


409844 


AW502336 




gb:U^F-BRO>aka^05-(MJIJl N1H_MGC_5 


1.35 


229 


402517 






Target Exon 


1.35 


210 


447042 


AB035863 


Hs.182217 


succtnate-CoA ugase, AOP-rormmg, Beta 


1.35 




405000 






Target Exon 


1.35 


232 


452065 


AK000360 


Hs.27721 


Wolf-Hirschhorn syndrome candidate 1-fik 


1.35 


236 


404666 






C9000748:gi)8324209jgb|AAB34384^ (S775 


1.35 


255 


451081 


A1078645 


Hs.431 


murine leukemia viral (bmM) oncogene h 


1.35 


1.70 


427979 


BE379776 


Hs,181309 


proteasome (prosome, macropaln) subunit, 


1.35 


223 


435825 


R16702 


Hs.91147 


ESTs 


1.35 


239 


426469 


BE297886 


Hs.293970 


me thytmal on als-semiatdehyde dehydrogenas 


1.35 


'1.40 


447002 


BE242866 


Hs.16933 


HepA-related protein 


1.34 


288 


410946 


AW811502 




gb:QV2-ST0145-06129941S-M4 ST0145 Homo 


1.34 


202 


454383 


AW500332 


Hs.11114 


hypothetical protein djl 181N3.1 


1.34 


213 


440512 


AAB87845 


Hs.19673 


suppressor of S. cerevisiae gcr2 


1.34 


205 


409885 


AW502208 




gb:UWF-8R0>^u-<M)9-0.UJ.r1 N1HJWGC.5 


1.34 


263 


447390 


X95384 


Hs.18426 


translaOonal inhibitor protein p14.5 


1.34 


1.00 
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450293 


MQC7CA 


nS.171ud 


Hypothetical protein FU 00026 


1.34 


2.45 


445831 


MM nngflcc 
NI^_UUOUDO 


iit> i?oci 


lanC (bacterial lantlbbtic synthetase c 


1.34 


1.60 


4100 IU 


AlAfOA COO'S 


H5.Z2J034 


hypothetical protein MGC2742 


1.34 


1.39 


V* 1 340 


AW298716 


Hs.1 20775 


ESTs 


1.34 


£30 




H49944 


Hs.1 4231 


selenoprotein W, 1 


1.34 


1.17 


H IO400 


QCC17Q7T 


Hs.48617 


Homo sapiens cDMA FU12540 fis, clone NT 


1.34 


2.22 




AlOCXCQC 


u. 4 coin 

Hs.1 58321 


beaded filament structural protein 2, pb 


1.34 


2.50 




1 I101O1 
UZ0I3I 




gb:Human HMGI-C chimeric transcript mRNA 


1.34 


£13 


441331 


AI216764 


Hs.149971 


ESTs, Moderately sfrniter to ALLfB .HUMAN ! 


1.34 


205 


411789 


AF245505 


Hs.72157 


Adfican 


1.34 


1.27 


A one AH 


Alii APlACAf 

NNL0005O5 


Hs.1321 


coagulation factor XII (Hageman factor) 


1.33 


1.25 


413892 


A1878921 


Hs.75607 


myristoylated alajiine-rich protein Wnas 


1.33 


1.41 


439750 


AL359053 


Hs.57664 


Homo sapiens mRNA full length insert cDN 


1.33 


1.99 


414861 


AL1 19396 


Hs. 77508 


glutamate dehydrogenase 1 


1.33 


1.66 


421687 


AL035306 


Hs.106823 


hypothetical protein MGC14797 


1.33 


2.18 


M C\QAC 

41Utt4b 


AW807057 




gb:MR4-ST0062-031 199-018-603 ST0062 Homo 


1.33 


£07 


44J3J/ 


R66571 


Hs.24601 


ESTs 


1.33 


2.02 


4o<3bU 


BE045243 


Hs.274416 


Target CAT 


1.33 


1.12 


44 J! IS 


AA312264 


Hs.7980 


hypothetical protein MGC12966 


1.33 


£68 


438464 


AA669735 


Hs.324743 


protein phosphatase 4 regulatory subunit 


1.33 


1.99 








ENSP00000198192*:BA438F9.1 (novel protei 


1.33 


1.10 


4U044J 






Target Exon 


1.33 


2.11 


4W7o4 


BE008180 


Hs.282846 


Homo sapiens cDNA FU14353 fis, clone V7 


f .33 


£88 


424924 


AL039103 


Hs.1 53834 


pumilio (Drosophila) homoiog 1 


1.33 


1.24 


453555 


N23574 


Hs.1 23649 


ESTs, Moderately similar to ALU7_HUMAH A 
C7002J 9r;gi]5053028/gbiAAD3881 1. 1|AF 1 55 


1.33 


£23 


404343 






1.33 


1.04 


412383 


AW947577 




gb:RCO-MT0004-1 40300-031 -t 09 MT0004 Homo 


1.33 


2.06 


404250 






Target Exon 


1.33 


£53 


413899 


AF083892 


Hs.75608 


tight junction protein 2 (zona ocdudens 


1.33 


2.61 


422716 


AI702835 


Hs.1 24475 


ESTs, Weakly similar to YEF4JTEAST HYPOT 


1.33 


£30 


A A QOO 

440002 


AI351979 


Hs. 152717 


hypothetical protein FU 13725 


1.33 


1.08 


409540 


AW409569 




gb:fh0le09j(1 NIH_MGCJ7 Homo sapiens cD 


1.33 


£18 


431186 


till f\4 'VMrt 

NNL012249 


Hs.250697 


ras4Eke protein 


1.32 


1.39 


402754 






NMJ)22469*:Homo sapiens hypothetical pro 


1.32 


1.16 


420798 


W93774 


Hs.99935 


Keratin 10 (epiaermolytic nyperxeralosis 


1.32 


£02 


459710 


AI701596 


Hs.121592 


ESTa 


1.32 


£70 


435192 


AK000739 


Hs.4835 


eukaryotic translation initiation factor 


1.32 


£22 


401383 






Target Exon 


1.32 


£18 


453394 


AW960474 


Hs.40289 


ESTs 


1.32 


£20 


421820 


AW662990 


Hs.294133 


heme-blndrng protein 


1.32 


1.24 


444047 


AI097452 


Hs.1 35095 


ESTs 


1.32 


£95 


440860 


R10482 


Hs.1 32876 


ESTs 


1.32 


£83 


425808 


AA364109 


Hs.177990 


ESTs 


1.32 


£11 


456558 


BE410992 


Hs.258730 


heme-regulated inilialion factor 2-alpha 


1.32 


£05 


447015 


AB033029 


Hs.16953 


KIAA1203 protein 


1.32 


1.30 


414015 


AA340987 


Hs.75693 


prafyicarboxypeptidase (angiotenshvase C 


1.32 


1.39 


414843 


BE386038 


Hs.77492 


heterogeneous nucfear ribonucleoprotein 


1.32 


1.28 


424058 


AL121516 


Hs.138617 


thyroid hormone receptor interactor 12 


1.32 


£01 


401196 






Target Exon 


1.32 


£13 


450147 


AW373713 


Hs. 146324 


CGM45 proton 


1.32 


1.32 


422699 


BE410590 


Hs.1 19257 


emsl sequence (mammary tumor and squamou 


1.32 


1.33 


405172 






Target Exon 


1.32 


£11 


434087 


AF1 16675 


Hs.334476 


hypothetical protein PR01942 


1.32 


£30 


416720 


H05435 


Hs.11110 


hypotheticat protein MGC2508 


1.32 


£18 


426621 


NM_001329 


Hs.171391 


C-termina) binding protein 2 


1.32 


1.53 


442685 


AB033017 


Hs.8594 


KIAA1191 protein 


1.32 


1.43 


443879 


Z28462 


Hs.9927 


Homo sapiens mRNA; cDNA DKF7p564D1 56 (fr 


1,31 


£24 


405180 


NMJrQ2649 




phospholnositide-3-kinase, catalytic, ga 


1.31 


1.36 


417365 


050683 


Hs.82028 


transforming growth factor, beta recepto 


1.31 


0.98 


402087 






Target Exon 


1.31 


1.31 


429323 


NM_001649 


Hs.2391 


apical protein, Xenopus laevts-like 


1.31 


£05 


409935 


AW511413 


Hs. 278025 


ESTs 


1.31 


1.20 


430235 


BE268048 


Hs.236494 


RAB10, member RAS oncogene family 


1.31 


1.31 


400172 






oos ooriuwi 




1.05 


421742 


AW970004 


Hs.107528 


androgen induced protein 


1.31 


179 


404273 






Target Exon 


1.31 


£35 


416204 


AW972270 


Hs.144054 


ESTs 


1.31 


£15 


435076 


AW298113 


Hs.92909 


SON DNA binding protein 


1.31 


£05 


452497 


AA732153 


Hs.27865 


Homo sapiens cDNA: FU21333 fis, done C 


1.30 


1.57 


404596 






Target Exon 


1.30 


£23 


419080 


AW150835 


Hs.18878 


hypothetical protein FU21620 


1.30 


1.15 


427195 


W27230 


Hs.173912 


eukaryotic translation initiation factor 


1.30 


1.34 


438129 


AA778647 




gb:af87d03.s1 Soares_lestis_NHT Homo sap 


1.30 


£55 


402138 






Target Exon 


1.30 


£09 


404029 






MM_018936*:Homo sapiens protocadherin be 


1.30 


£83 


402731 


AL042818 




E3 ubiquitin ligase SMURF1 


1.30 


3.32 


458766 


AW1B3618 


HS.5561Q 


sohrte carrier family 30 (zinc transport 


1.30 


1.56 
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434585 


AW451715 


Hs.184075 


ESTs, Weakly similar to ALU1 HUMAN ALU S 


1.30 


2.73 


417219 


AW973473 


Hs.220936 


ESTs 


1.30 


245 


428125 


AA393071 


Hs.182579 


leucine amlnopeptldase 


1.30 


2.00 


416188 


BE157260 


Hs.79070 


v-myc avian myeiocytornalosrs viral oncog 


1.30 


1.00 


444681 


AJ243937 


Hs.288316 


chromosome 6 open reading frame 9 


1.30 


0.94 


406621 


X57809 


Hs.181125 


immunoglobulin lambda locus 


1.29 


1.02 


436663 


AW410458 


Hs.5258 


chromosome 11 open reading frame2 


1.29 


1.20 


417250 


N5B241 


Hs.332115 


ESTs 


1.29 


143 


434978 


AA321238 


Hs.4310 


eukaryolfc translation initiation factor 


1.29 


1.91 


448079 


R76991 




IhyroW hormone receptor-associated prot 


1.29 


2.01 


450626 


AW190989 


Hs.1508 


Insulin-degrading enzyme 


1.29 


2.09 


456059 


BE543127 


Ks.336948 


Homo sapiens, done IMAGE:3530891, mRNA, 


1.29 


2.23 


417809 


H75797 


Hs.233550 


zinc finger protein 208 


1.29 


2.20 


454771 


AY7819939 


Hs.273629 


ESTs 


1.29 


2.10 


413895 


BE178160 




gb:R(^HT060(W6040(W)22-h10 HT0600 Homo 


1.29 


2.08 


404649 






. Targe) Exon 


1.29 


1.32 


440676 


ML004987 


Hs.112378 


UM and senescent cell antigen-like doma 


1.29 


2.08 


405891 






Target Exon 


1.29 


2.00 


418965 


AW02238 


Hs.11482 


splicing factor, argjru'ne/senne-nch 1 1 


1.29 


241 


412824 


AW958075 


Hs.11261 


small protine-nch protein 2A 


1.29 


1.27 


420037 


BE299598 


Hs.1 35569 


hypothetical protein FU14708 


1.29 


1.23 


459221 


BE246522 


Hs.306121 


leukocyte receptor cluster (IRC) encoded 


1.28 


2.48 


456651 


AW612481 


Hs.104105 


ESTs 


1.28 


235 


422984 


W28614 




chorionic somatomammo tropin hormone 1 (p 


1.28 


1.37 


459365 


BE067754 




gb:MR4-BT0358-t4rj40(H)Q6-g10 BT0358 Homo 


1.28 


1.06 


418254 


AA732511 


Hs.86650 


ESTs 


1.28 


238 


402474 






NM_004079:Homo sapiens cathepsln S (CTSS 


1.28 


1.99 


448456 


A1521830 


Hs.171050 


ESTs 


1.28 


218 


450098 


W27249 


Hs.8109 


hypolheficaJ protein FU2108O 


1.28 


1.68 


405053 






Target Exon 


1.28 


3.23 


428915 


AJ041278 


Hs.87908 


Snffreiafed CBP activator protein 


1.28 


4.25 


443721 


AW450451 


Hs.266355 


ESTs 


1.28 


1.15 


452047 


N35953 


Hs.43510 


ESTs, Weakly similar to BOX B BINDING FA 


1.28 


230 


440213 


AW246253 


Hs.7043 


sucrinate-CoA Bgase, GDP-forming, alpha 


1.28 


1.19 


452900 


AA626794 




prothymosin, alpha (gene sequence 28) 


1.28 


1.27 


418721 


MUM2731 


Hs.87773 


protein kinase, cAMP-dependent, catalyti 


1.28 


276 


458911 


AA373131 


Hs.24322 


ATPase, H transporting, lysosomal {vacuo 


1.28 


1.21 


444250 


R40815 


Hs.12396 


ESTs, Weakly similar to 2004399A chramos 


1.28 


243 


431631 


AA548906 


Hs.122244 


ESTs 


1.27 


1.51 


447966 


AA340605 


Hs.105887 


ESTs, Weakly similar to Homotog of rat Z 


1.27 


3.08 


430316 


NM_000875 


Hs.239176 


insulin-like growth factor 1 receptor 


1.27 


1.37 


416272 


AA178882 




gbzp38b09.r1 Stratagene muscle 937209 H 


1.27 


ZOO 


437456 


AL047045 


Hs.60293 


Homo sapiens done 122482 unknown mRNA 


1.27 


3.18 


456327 


H68741 


Hs.38774 


ESTs 


1.27 


2.35 


403349 


NM_0014Q6 




ephrin-B3 


1.27 


228 


428821 


H91282 


Hs-286232 


Homo sapiens cDNA: FU231 90 fts, clone L 


1.27 


2.13 


454555 


AW807095 




gb:MR4-ST0062-M0100-024-e02 ST0062 Homo 


1.27 


2.05 


406872 


A1760903 




gb.-wi09h08Lx1 NCLCGAP..CLL1 Homo sapiens 


1.27 


1.44 


401720 






NM.01 4587*:Homo sapiens SRY (sex determi 


1.27 


2.07 


40X82 






- Eos Control 


1.27 


1.26 


420183 


W92885 


Hs.143408 


ESTs 


1.27 


224 


411579 


AO005258 


Hs.70830 


U6 snRMA-associated Sm-tike protein LSm7 


1.27 


1.17 


402191 






NM_02l733*:Homo sapiens teslis-specJfic 


1.27 


2.44 


457118 


A!245525 


Hs.182469 


Homo sapiens mRNA; cONA DKFZp564K1972 (f 


1.27 


217 


408576 


NM_003542 


Hs.46423 


H4 histone famSy, member G 


1.27 


2.78 


452826 


BE245286 


Hs.301636 


peroxisomal biogenesis factor 6 


1.27 


3.15 


414909 


R80316 


Hs.1 32569 


PP2135 protein 


1.27 


1.37 


416114 


A1695549 


Hs.183868 


glucuronidase, beta 


1.26 


248 


455476 


AW948172 




gb:RCO-MT0013-280300-021-b06 MT0013 Homo 


1.26 


218 


445926 


AF054284 


Hs.334826 


spfidng factor 3b, subunit 1, 15&D 


1.26 


1.35 


432647 


AI807461 


Hs.278581 


fibroblast growth factor receptor 2 (bac 


1.26 


1.18 


405436 






Target Exon 


1.26 


Z38 


406140 






Target Exon 


1.26 


3.20 


426201 


AW182614 


Hs.1 28499 


ESTs 


1.26 


1.17 


433334 


AJ927208 


Hs.231958 


matrix metaltoproteinase 28 


1.26 


Z30 


423262 


NML005479 


Hs.126057 


frequently rearranged in advanced T-ceB 


1.26 


Z61 


422929 


AA356694 


Hs.94011 


ESTs, Weakly similar to MGB4_HUMAN MELAN 


1.28 


Z11 


445605 


AI906088 


Hs.87159 


hypothetical protein FU12577 


1.26 


3.11 


425050 


BE391854 


Hs.7970 


gb:601265394F1 NIH_MGC_44 Homo sapiens c 


1.26 


Z18 


420539 


AA282735 


Hs.44004 


AO031 protein 


1.26 


Z03 


437352 


AL353957 


Hs.284181 


hypothetical protein DKFZp434P0531 


1.25 


1.19 


456535 


AA305079 


Hs.1342 


cytochrome c oxidase subunil Vb 


1.25 


1.18 


434202 


BE382411 


Hs,3764 


guanylata kinase 1 


1.25 


1.14 


439528 


BE613180 


Hs.288368 


Homo sapiens cDNA' FU21314 fis, done C 


1.25 


Z12 


400178 






Eos Control 


1.25 


Z15 


430023 


AA158243 


Hs.227729 


FK506-Wnding protein 2 (13kD) 


1.25 


1.20 


412841 


AJ751157 


Hs.101395 


hypothetical protein MGC1 1352 


1.25 


1.39 
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425655 


BE614551 


Hs.738 


ribosomal protein L14 


1.25 


1.22 


443036 


AI656608 


Hs.281328 


ESTs, Weakly similar to T00378 KIAA0641 


1.25 


3.00 


418406 


X73501 


Hs.84905 


cy taker atin 20 


1.24 


211 


414570 


Y00285 


Hs.76473 


insulin-tike growth factor 2 receptor 


1.24 


1.14 


436957 


AA761729 


Hs.1 36705 


ESTs 


1.24 


2.53 


457216 


AA452554 


Hs.283697 


ESTs, Weakly similar to A41 796 neural re 


1.24 


Z18 


418414 


J04977 


Hs.84981 


X-ray repair complementing defective rep 


1.24 


1.35 


423217 


NM_000094 


Hs.1640 


collagen, type VII, alpha 1 (epWermolys 


1.24 


0.92 


421733 


AL1 19871 


Hs.1420 


fibroblast growth factor receptor 3 (ach 


1.24 


271 


402329 






NM.OOesOS'iHomo sapiens poBovlrus recep 


1.24 


1.13 


447525 


AF151031 


Hs.300631 


hypothetical protein 


1.24 


1.07 


445939 
421936 


BE018658 


Hs.141003 


Homo sapiens cDNA: aJ21691 fis, clone C 


1.24 


2.23 


AB040884 


Hs.109694 


WAA1451 protein 


1.24 


2.15 


433581 


AI004377 


Hs.200360 


Homo sapiens cDNA FU 13027 fis, done NT 


1.24 


215 


426717 


N90977 


Hs.49690 


Homo sapiens mRNA; cDNA DKFZp434D2328 (f 


1.24 


214 


404751 


T70445 




ribosomal protein L9 


1.24 


1.30 


411456 


AW847588 




gb:lL3-CT0213-161299438-G09 CT0213 Homo 


1.24 


2.35 


425417 


AF098948 


Hs.157113 


coenzyme Q, 7 (rat, yeast) homolog 


1.24 


2.88 


434508 


AI648601* 


Hs.1 18012 


ESTs 


1.24 


2.03 


428284 


AA535762 


Hs.183435 


NM_004545:Homo sapiens NADH dehydrogenas 


1.24 


1.59 


418597 


AK001678 


Hs.88337 


similar to DNA-directed RNA polymerase I 


1.24 


2.27 


414191 


AW250089 


Hs.75807 


PDZ and LiM domain 1 (elfin) 


1.24 


1.53 


449210 


A1635363 


Hs.345517 


ESTs 


1.24 


2.18 


439551 


W72062 


Hs.11112 
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hypothetical protein 


0.98 


2.20 


421594 


R45589 


Hs.21889 


Homo sapiens cONA FU12978fis, clone NT 


0.98 


3.09 


444654 


AV650572 


Hs.23440 


WM1105 protein 


0.98 


ZOO 


409759 


N40285 


Hs.81182 


histamine N-methyi transferase 


0.97 


2.20 


401936 






Target Exon 


0.97 


£39 


403463 






Target Exon 


0.96 


258 


434421 


A19 15927 


Hs.34771 


ESTs 


0.96 


2.15 


412636 


NM.004415 




desmopJaWn (DPI, DPI!) 


0.95 


2.01 


442432 


BE093589 


Hs.38178 


hypothetical protein FU23468 


0.93 


£55 


421938 


AA405951 




gb:zu66c01.r1 Scares Jestis.NHT Homo sap 


0.93 


3.10 


447470 


BE618324 


Hs.263561 


ESTs, Weakly similar to A53531 oncofetal 


0.92 


2.08 


448369 


AW26B962 


Hs.111335 


ESTs 


0.91 


2.35 


421710 


AB007930 


Hs.107088 


WAA0461 protein 


0.91 


2.63 


406805 


AI686003 


Hs.296031 


ESTs 


0.91 


£21 


447475 


A1380797 


Hs.1 58992 


ESTs 


0.90 


3.25 


428892 


U82828 


Hs.194382 


ataxia telangiectasia mutated (includes 


0.90 


£02 


450222 


U75308 


Hs.24644 


TATA box binding protein (TBPyassociate 


0.89 


£73 


401572 






C15001384*:gi|12737057|retlXP_012129.1| 


0.88 


£00 


429226 


AA913330 


Hs.53542 


choreoacanthocytosis gene; K1AA0986 prot 


0.88 


£37 


421979 


AW062518 


Hs.233150 


hypothetical protein MGC5560 


0.87 


3.70 


407614 


NMJM1932 


Hs.37144 


membrane protein, patmitoylated 3 (MAGUK 


0.86 


£02 


417912 


R25269 


Hs.50547 


ESTs 


0.86 


£00 


401654 






NM_00724£Homo sapiens DEADiH (Asp-Glu-A 


0.86 


£11 


403149 






NM 001450:Homo sapiens four and a half L 


0.86 


£19 


413000 


BE046280 




gb:hn43c09j(2 NCI CGAP„RDF2 Homo sapiens 


0.85 


£40 


425165 


AK001458 


Hs.154919 


WAA0625 protein 


0.85 


£18 


447371 


AA334274 


Hs.1 8368 


DKFZP564B0769 protein 


0,84 


£13 


452801 


AI935587 


Hs.34447 


ESTs 


0.84 


£55 


400957 






Target Exon 


0.83 


£15 


426420 


BE3838Q8 


Hs.322430 


KDRG family, member 4 


0.83 


£14 


429354 


AA451666 


Hs.269363 


ESTs 


0.80 


£25 


417831 


H15423 


Hs.82685 


CD47 antigen (Riwelated antigen, integr 


0.78 


£43 


443368 


8E568891 


Hs.199210 


ESTs, Moderately similar to bKl 16F5.2 (H 


0.78 


£00 


441901 


AI914445 


Hs.128103 


ESTs 


0.76 


£06 


429462 


A1890356 


Hs.127804 


Homo sapiens, clone IMAGE:3536432, mRNA, 


0.76 


£03 


403010 






C2100015£gij6226483|sp|Q521 1BJYM03.ERWS 


0.75 


£43 


420344 


BE463721 


Hs.97101 


putative 6 protein-coupled receptor 


0.75 


£42 


448332 


AW293110 


Hs.171068 


ESTs 


0.74 


£00 


421674 


T10707 


Hs.296355 


hypothetical protein FU23138 


0.71 


£18 


450845 


AL117441 


Hs.301997 


hypothetical protein FU 13033 


0.67 


£06 


448514 


AB020626 


Hs.301B66 


WAA0819 protein 


0.67 


£03 


431609 


AW792792 


Hs.264330 


M-acytsphmgosina amldohydrolasB (acid c 


0.60 


£54 


417512 


X76534 


Hs.82226 


glycoprotein {transmembrane} nmb 


0.60 


£00 


425960 


AW410646 


Hs.164649 


hypothetical protein DKFZp434H247 


0.57 


£15 


408077 


AL133574 


Hs.42458 


Homo sapiens mRNA; cDNA DKFZp586Cl817 (f 


0.53 


£29 



imm 

Pkey: Unique Eos probeset Identifier number 
CAT number. Gene duster number 
Accession: Genbank accession numbers 



Pkey 
409345 



CAT Number 
112147J 



409540 


1138613.1 


409806 


1155259 1 


409838 


1155987J 


409844 


1156139 1 


409855 


1156256.1 


409865 


1156518 J 


410600 


121 108 J 


410846 


1223902.1 


410946 


1227589.1 


411456 


1246706.1 



Accession 

AJ949109 AW1 18631 AI833148 AW1 17891 R45932 R80970 AA7331 10 AW269428 T2702A R97693 All 49202 AI475492 H98845 AA609665 

A1125598 AW663742 AI140029 AW103164 AA126862 N66829 WS5398 H94509 W65300 R07712 R36955 AA812477 AA609576 A1754304 

Z41075 AA970004 AW274661 AA923584 A1673108 AAO70706 AA541812 T90938 

AW409569 BE297044 BE295828 

AVIOT960AW501280AW500814 

AW502928 AW505606 AW501576 AW501577 

AW502336 AW502339 AW501736 AW501839 

AW502461 AW503000AW502207AW501862 

AW502208 AW502366 AW502148 

AW575742 BE549623 A1335824 BE463447 AA729043 AW408712 AW499616 AA086179 AW499617 AA191322 

AWB07057 AW807054 AW807189 AW807193 AW807369 AW807429 AW807364 AW807365AW807078AW807256AW807180AVV807331 

AW81 1502 AW81 1521 AW811548 AW811471 AW811511 AW811508 

AW847588 AW847716 AW847664 AW847592 

213 



WO 03/003906 



PCI7US02/21338 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



411609 
412210 
412383 
412560 



412636 



412998 
413000 
413611 
413709 
413804 
413895 
415789 
416023 
416272 
417500 
419555 
420685 
421762 
421938 



422984 



432638 

433821 
434767 

434809 
438129 
438713 



1251530.1 
1283515J 
1292509J 
130601.1 



13165J 



439004 
441623 



442473 
442506 
445182 
447703 



134321 8_1 
1343239J 
1380017.1 
'1384144J 
1390710J 
1397743.1 
1555357.1 
156696.1 
158407.1 
168443.1 
185884.1 
195591.1 
206590.1 
209376.1 
22276.1 



223488.3 



35173.4 

374566.1 
39297.1 

393566.1 
450500.1 
463722.1 



438924 4669.1 



467743.1 
52182.1 



543413.1 
54405.1 
632151.1 
733191.1 



AW993680AW853769 
AW901492 AW947725 AW901448 

AW947577 AW947574 AW947576 AW947734 AW947733 AW947732 

R24601 R23657 AA194467 AI948584 AI678666 AA194383 AI765219 AA702993 AA813511 AA620965 A1990303 AI624882 A1003925 
AI33B870 AI004689 AJ004690 Al 127228 Z25302 F29302 BE044308 F32992 AA1 12966 T30825 F24958 F18071 C00537 T30B41 R24502 
AI934786 AI770075 AI144132 AA812597 AW20397B H82735 A1813349 AI142908 C04894 AI208243 AI208044 N89963 AI767866 A1290470 
AIB65963 H8361 1 BE463806 A! 082639 M507760 „ 
NM 004415 AL031058M77830 BE149760AW752599AW848723AW376697 AW376817AW376599 AW848371 AW376782AW848789 
AW361413 AW849074 AW9971 39 AW7993Q4 AW799309 BE077020 BE077017 BE1B5187 AW997196 BE156621 BE179915 BE006561 
BE143155 AW890985 BE0021 07 AW1 03521 AA857316 AW383133 BE011378 AW170253 BE185750 AW886475 BE160433 J05211 
BE082576 BE0825B4 BE004047 AW607238 AW377700 AW377699 BE082526 BE0B2505 BE082507 BE082514 AW1 78000 AW1 77933 
AI905935 AW747877 AW746114 BE148516 AW265328 AW847678 AWB47688 AW365151 AW355148 AW365153 AW365158 AW365175 
AW365157 AW365154 AW058840 BE005272 AW365145 BE001925 BE182166 BE144243 BE001923 AI951766 AW34518 BE1&49M 
BE184933 AI284090 BE184941 AW804674 BE184924 C04715 W39488 AW995615BE184948 BE159646 AW605653 AA099891 AA1311 28 
AA337270 AA340777 A W3 84371 AA85221 2 R58704 AW356566 AW364859 AA025851 AA025852 AA455100 M7 19958 AVW52220 
AW996245 BE165351 BE073467 AA377127 AW890264 AW609750 AW391912 AW849690 T87267 AWB53812 AA852213 W74 1491 BE009090 
AA056401 H91011AVV368529AVV390272C1B457AW674920 N57176 M026480AW576767 H93284 M0268ra 
AW177786 BE092134 BE092137 BE092136 AW177784 A1022862 BE091653 AW376811 AW848592 AA040018 BE185331 BE1B2164 
AA368564 AW951576 T29918 AA131077 W95048 W25458 AW205789 H90899 N29754 W32490 R20904 BE167181 BE167165 N84767 
H27408 H30146 AI190590 C03378 A1554403 A1205263AA 128470 AI392926 AF139065 AW370813 AW370827 AW798417 AW7987M 
AW798883AW798569 R33557 AA1 491 90 C03029AW1 77783 AA088866 AW370829 AA247685 BE002273 A1760816 AI439101 AW879451 
AI700963 M451923 A1340326 AI590975 T48793 A1568096 A1142882 AA039975 AI470146 AA946936 BE067737 BE0677B6 W1ffiB7 
AA644381 AA702424 A1417612 AI305554 AI686869 AI568892 AW190555 AI571075 AI220573 AA056527 A1471874 AI304772 ^517828 
AI915596AI627383AI270345AW021347AW^ 

AI830049 AI8B7258 AW780435 AI910434 A18199B4 A1B58282 A1078449 AI025932 AI860584 AI635878 AA026047 JW03232 D12062 
AW19G0B5 AA658154 AW514597 AW591892 T87181 AA782066 AW243815 AW150038 AW268383 AW004633 AI927207 AA782109 

AI783914 H44405 AW799118 AA1 28330 AA51 5500 AA918281 W02156 AI905927 AA022701 W38382 R20795T77861 AW860878 
BE046254 BE046673 BE046253 
BE046280 BE046763 BE046676 

BE153275 BE153169 BE153329 BE153022 BE153030 BE152974 

BE158687 BE158688 

T64682 BE168190 BE168256 

BE178160 BE177986 BE17B330 BE178480 

H01581 H12B50 R65905 H13053 

AA173029BE467711 AA176710 

AA178882AA179898AA178897 

H59970AA203382 R08822 

AA244416AA244401 

AA279362 AM54496 AA584871 

AA297546AA297410AA297401 AA297465 AA297268AW966174 

BE242131 AA339340AA371380 AA334618 AA316398 

AW572486 N52563 N89687/ 



WM7717 AA34999B AA350286 BE501310 R48207 AI372769 AJ003450 AA915976 A1261513 AI033019 AI222825 AW150983 AA420700 



^1^416AM18195 M778707AA927922 AA868718AA853991 AA609856AA634398 



AF153201 AW888811 Anwu«ivn»™-^ v , vw^.w • . — ----- - , * n^con a 

AI091885 AA206800 AW370684 AA904608 AA806352 AAB94757 AVW68376 AA807284 AA311 341 AW3583U 
AW974687 AA649656 AA652145 

^^!^^f R ^^vn^<i FMRM KM737 AIS38427 AW51714 AWI72603 F02602 F01437 AI684417 A1801950 F02970 Z39367 

iftSmaHO AJB01493 AI873767 A076042 AJ804327 AA813438 AA680002 A1274492 T18177 AI287337 A1935050 AA907805 

AI696593 AI858482 A1556B01 AW072797 AA1 28047 AA035502 AW243274 AA992517 R43760 

A1277548 

W27992AF0S6388 

BE566411AL121194AW976385AW366882AI767324 „ r .^,„«„ 

AW189787 AI215430 AW268499 AW20593O AI392907 BE093017 BE093019 BE093010 
A1420277AW747989W26565 



214 



WO 03/003906 



PCT/US02/21338 



448079 74834.1 



450936 85190.2 
451844 888230.1 
452900 93691.1 



454412 
454555 
454717 
455234 
455424 
455474 



455591 

455604 
455842 
457747 
458991 
459192 



453085 94851.1 



1174764.1 
1223870.1 
1230516.1 
1265385.1 
1289247.1 
1292960.1 



455476 1293055J 



1335166.1 

1337197J 

1374629.1 

397222.1 

850804.1 

923891.1 



R76981 AA101801 AA1 01 722 AA1 22297 AA098802 BE0Q6483 AA343200 AW963375 BE090425 AA213871 BE621402AA309094 AW960171 
AIZ75045 Z44230 AW243724 AI051487 AI376624 R58631 AW978550 C00116 AI984051 AA122265 AI379941 AW205843 AW205839 
AW1 90219 AW129532 A1954133 A1568869 A1383948 A1537386 AA213788 A1083416 A1360826 AA101802 AI619505 AA101795 M101723 
AW614392 R64287 AA927599 A1251904 AI803003 A1675123 AW023079 AW134959 AJ926156 AA831069 AI638324 F29901 R46085 P03383 
C17731 R205B4T90131 

A1033745 A1034133 AA844424 AW166024 AI831699 A1971097 AA011685 H70852 H70851 
T51430 AI820546 Al 821 336 

AA626794 AA626779 AA071274 AA928041 AI954235 N71035 N70230 AW674412 A1871 1 36 AJ563955 AJ954237 AA649543 AI340231 
A1368586 AI86B721 W44485 T83736 AA126250 AI34361 9 H94297 T47633 AI672897 AA49S355 R23240 A1814680 AA9021 1 9 AA644262 
N67040 AW074273 A1357512 AA865354 AI027942 R33837 H95828 N63928 AI41B701 AJ186469 AA693672 AA77B429 AA128352 AW954072 
C0001 5 AA861 B53 AW022016 AI955S45 A17531 18 AI755095 AA029523 T700B6 AA029458 AW675640 N79606 A1659597 AI41 7119 A1804089 
AI383091 AA128493 T839O7T47632AA09411BAA4964O5 H94191 AW887759 N98454 AA512988 A1623761 AW028373AW249740 
AI187029 AA991733 AJ683085 A1336036 DS0550 D60466 D60009 D60248 AW262673 A1524080 AA205643 AI969141 AW674989 M093704 
R81248 R73069 W01019 R23316 C14688 W44435AA093663 T70157 H57883 R94790 D81287 D81788 AA364340 AA383533 W«W 
BE295144 D60973 081466 D60551 R33836 D60249 AA354560 W01933 T71 478 AA51 5461 F27984 BE084745 W19084 R81247 A1080252 

S 6 243aS 2 30 AA412478 AA828434 AA814538 A19274 18 All 92435 W52897 AA443666 AA031913 A1683306 AA918481 A1183314 
D83907 A1206832 AA876122 D83836 D83838 D82533 AI761290 A1191 125 AJ143749 AW771909 AI241436 A1767267 WfMJ MW7787 
AAS68692T10502A1247870M715017M643304AA890233AA811387AAB97470 M9 

AI783713 N80205 W56778 AA676899 AI888718 N69930 A133B935 AI217580 AA639508 AA575836 BE046852 A1312651 W038406 ; AA6afc49 

AA643838 A1493761 AA032024 W38849AA340178AA447052 AA452969 W19369 AA296364 H44229 W58767 C05751 C05835A1741989 

N9B532 AW102617 AA412583 AI922246 W3B495 AA355375 AA928571 C06275 AA352500 N931 32 

AW582568 AW81 8656 AW81 8647 AW818655 AW81 8637 AW818234 

AW807095 AW807022 AW845880 AW807096 AW807461 AW8461 1 6 AW807070 

AW815123 AW815138 AW815259 

R41084AW875856 

awum 51 AWQ480S8 AW948109 AW948105 AW948097 AV^B120AW948108 AW948111 AW948114AW948115 AVV948072AVV948083 
AW948062 



SmsSs) BSfBE A KBE007994 BE008016 BE008019 BE008024 BE008022 BE008027 BE0GB029 BE008020 
BE008015 BE008021 BE008028 BE008023 BE008030 BE008014 
BE01 1 1 83 BE01 1 170 BE01 1 333 BE01 1 1 88 BE01 1 1 81 BE01 1324 BE01 1161 BE011169 
BE145837BE145894 
AW975000 AA658945 AA661 558 

AwS A K AW176203 AW1761B1 AW176213 



TABLE 9C 



Pker Unique number corresponding to an Eos probesel 

Ref: Sequence source. The 7 digit numbers in this column are Genbanfc U »^J^I) mimbere. Dunham L et al 

sequence of human chromosome 22.' Dunham I. et al., Nature "™ 
Strand: Indicates DNA strand from which exons were predicted. 
Nlposition: Indicates nucleotide posiliorts of predicted exons. 



refers to the publication entitled The DNA 



402:48*495. 



Pkey 

400634 

400750 

400752 

400772 

400773 

400788 

400789 

400835 

400843 

400844 

400845 

400846 

400847 

400957 

400967 

401038 

401041 

401058 

401156 

401177 

401196 

401204 

401286 

401346 

401371 



Ref 


Strand 


B567750 


Minus 


8119067 


Plus 


7331445 


Minus 


8131629 


Minus 


8131629 


Minus 


7342055 


Plus 


8307741 


Plus 


8954121 


Plus 


9188605 


Plus 


9188605 


Rus 


9188605 


Plus 


9168605 


Rus 


9188605 


Rus 


7705148 


Minus 


7770682 


Minus 


7232177 


Minus 


7232177 


Plus 


8117654 


Minus 


9438289 


Rus 


9438503 


Minus 


9719673 


Plus 


9743388 


Minus 


9801342 


Minus 


9926605 


Minus 


9650602 


Plus 



NLposifion 

101102-101223,101886-102018 
198991-199168,199316-199548 
3621546461 

34898-35021.4107841197 

44116-44238,4820848321 

184369-184715 

82281-83693 

89368-89622 

586^0.7853^ 

24746-24872,25035-25204 

34428-34812 

3931039474 

4464344835 

66959-67241 

32697-32999 

4277-4469 

4475045076 

4522645414 

31381-31526 

62773-63330 

3313843834 

33694-33872 

147036-147318 

12031-13032 

80901-81283 

215 



WO 03/003906 



401383 


6721135 


Minus 


155543-157381 


401496 


7381769 


Minus 


82790-83002 


401512 


7622346 


Plus 


136399-136557 


401563 


8247910 


Plus 


91395-91763 


401572 


8570271 


Minus 


78651-78889 


401593 


7230957 


Plus 


10368-10572,11293-12356 


401600 


4388746 


Minus 


27363-27518.28727-28891,29526.29731 


401603 


7589963 


Minus 


115659-116780 


401613 


4878082 


Plus 


22461-22B31 


401654 


9097132 


Minus 


64695-64797 


401660 


9100564 


Minus 


173662-174024 


401720 


6468551 


Plus 


7783^468 


401784 


7249190 


Plus 


148362-148606.149453-149535,149731-149962 


401B35 


7139700 


Plus 


142257-142742 


401886 


7229913 


Minus 


79215-79393 


401936 


3308091 


Plus 


46817-46943 


402001 


9501818 


Plus 


68052-68223 


402087 


8117546 


Plus 


137069-137213.138678-138828,138969-139050 


402138 


7704985 


Plus 


14173-15108 


402144 


7242326 


Plus 


115425-115977 


402190 


8576067 


Minus 


76488-76959 


402191 


8576073 


Minus 


69410-69583 


402329 


7798735 


Plus 


15833-16180,18419-18715,22507-22624 


402368 


9558577 


Minus 


47218-47330,48052-48203 


402371 


9558584 


Plus 


68736-68956 


402378 


9625333 


Minus 


41312-41468,48313-48720 


402449 


9796674 


Rus 


59867-60039,62588-62828.53465-53623,64923-66108 


402474 


7547175 


Minus 


53526-53628,55755-55920,57530^7757 


402517 


9798106 


Plus 


17569-17721 


402599 


7239566 


Plus 


5835-5987 


402617 


9930797 


Minus 


69466-69945 


402712 


8969253 


Minus 


10941-11138 


402731 


9211639 


Minus 


117913-118004,121110-121211,121327-121457,125478-125623,126540-126663 


402737 


9212184 


Minus 


13358-13552 


402754 


9213730 


Plus 


15345-15B52 


402760 


9213869 


Plus 


136829-136952,137336-137521 


402812 


6010110 


Plus 


25026-25091,25844-25920 


402845 


9369288 


Plus 


160451-160617,160788-161009 


402869 


6434643 


Minus 


138639-139335 


402892 


8086844 


Minus 


194384-194645 


403010 


3132346 


Plus 


78385-79052 


403149 


9799833 


Plus 


25034-25185 


403326 


8440025 


Minus 


110959-111122 


403329 


8516120 


Plus 


96450-95598 


403342 


7233487 


Minus 


42312-43750 


403349 


8569773 


Minus 


167815-168374 


403381 


9438267 


Minus 


26009-26178 


403463 


9929538 


Plus 


102596-102879 


403655 


8736093 


Plus 


65568-65859 


403725 


7534031 


Rus 


86737-86843 


403728 


7534291 


Minus 


34481-34671 


403807 


8439933 


Minus 


162963-165773 


403826 


9838209 


Rus 


121197-121358 


403885 


7710403 


Minus 


53259-53524 


403943 


7711864 


Plus 


100742-100904,101322-101503 


4039B8 


8576087 


Plus 


16251-16462 


404029 


7671252 


Plus 


108716-111112 


404058 


3548785 


Plus 


99397-101808 


404069 


3168619 


Phis 


4731047450 


404178 


7630978 


Minus 


178075-178383 


404204 


3169112 


Minus 


79868-80321 


404231 


6218035 


Minus 


61077-61322 


404250 


9187145 


Minus 


36099-36212,37928-38075 


404273 


9885189 


Plus 


97789-98285,99601-99855 


404343 


9838093 


Phis 


122664-122931 


404349 


7630858 


Minus 


61006-61187 


404391 


3135305 


Minus 


26030-26173,27852-27997 


404392 


3135305 


Minus 


29738-29857 


404397 


9558608 


Minus 


104042-104232 


404438 


6984205 


Rus 


63413-63553 


404439 


7139680 


Rus 


55316-55585 


404530 


6479107 


Plus 


3157-3304 


404596 


9958262 


Minus 


104807-105043 


404649 


9796926 


Minus 


100027-100399 


404566 


7272179 


Minus 


18677-18993 


404687 


9797554 


Minus 


128456-128565 


404744 


9187237 


Rus 


71776-71852,72885-73019,73700-73822,74692-74850 
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404751 


7630939 


Bus 


113799-114252,114393-114715 


404769 


8099713 


Minus 


175801-176823 


404790 


7230958 


Plus 


38611-38761 


404906 


7331453 


Minus 


100985-101126 


404956 


7387343 


Plus 


55883-56203 


404977 


3738341 


Minus 


4308143229 


405000 


©57544 


Minus 


88854-89993 


405022 


7330304 


Plus 


217163-217439 


405053 


7651944 


Minus 


157134-157430 


405058 


7655685 


Plus 


150740-151556 


405155 


9966228 


Plus 


130469-130723 


405163 


9966267 


Minus 


16117M61299 


405155 


9966302 


Plus 


6461-6845 


405166 


9966302 


Plus 


4052640891 


405172 


9966752 


Plus 


153027-153262 


405180 


7139743 


Plus 


6543W5740 


405193 


7230072 


Plus 


128187-128383 


405194 


7230072 


Plus 


190465-190645,193346-193610 


405245 


7249293 


Minus 


57560-58312 


405365 


2275192 


Minus 


119867-120372,120481-120824,121029-121357 


405376 


1552533 


Plus 


28875-29099 


405418 


6997292 


Plus 


51839-51953 


405436 


740806B 


Minus 


55716-55979 


405443 


7408143 


Plus 


90716-90887,101420-101577 


405474 


8439781 


Pius 


172005-172175 


405502 


9211311 


Minus 


50360-50584 


405580 


4512267 


Pius 


169232-169647 


405595 


7159256 


Plus 


4758S47688 


405630 


4508116 


Minus 


103218-103291.105858-105993,110051-110126 


405646 


4914350 


Plus 


741-969 


405710 


5531256 


Minus 


66203-66832 


405806 


7274891 


Minus 


224961-226760 


405812 


4775630 


Minus 


29424-29764 


405855 


7652031 


Minus 


60377-60795 


405879 


6758747 


Minus 


547B9-55457 


405891 


6758795 


Plus 


41062-41861 


405902 


6758795 


Minus 


82322-83110 


405932 


7767812 


Minus 


123525-123713 


406038 


8389537 


Plus 


37764-37877 


406140 


9168231 


Minus 


49887-50219 


406181 


5923650 


Plus 


16586-16B55 


406231 


7417725 


Plus 


17206-17641,17772-17968 


406248 


7417725 


Plus 


49711-50227 


406274 


7543787 


Plus 


932-1123 


406356 


7107907 


Plus 


18761-18973 


406404 


9256305 


Minus 


152569-152874 


406467 


9795551 


Plus 


182212-182958 


406557 


7711569 


Minus 


5445-5574,6170-6352 


406575 


7711679 


Plus 


142034-142473 
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TABLE 10A: Genes pfeferenliatly expressed in noninvasive bladder tumors 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene .gene title 

BJ 80th percentile of Ta tumor Als divided by the 80th percentile of T2-T4 tumor Ais 



Pkey 


ExAccn 


UnigenelO 


Unigene Title 


R1 


421110 


AJ2507 17 


Hs.1355 


cathepsin E 


8.23 


420001 


AF 196478 


Hs. 188401 


annexin A10 


5.78 


451668 
415511 


243948 


Hs.326444 


cartilage acidic protein 1 


5.53 


AJ732617 


Hs.1 82362 


ESTs 


4.72 


428336 


AA503115 


Hs.183752 


mooseminoprotein, beta- 


4.66 


418026 


BE379727 


Hs.83213 


fatty acid binding protein 4, adipocyte 


4.62 


400752 






NJA_003105 f :Homo sapiens sortilin-related 


3.99 


430315 


NM.004293 


Hs.239147 


guanine deaminase 


3.82 


403010 






C21 000152:gi|6226483|sp|Q521 1 BIYM03_ERWS 


3.56 


404977 






InsuDn-fike growth factor 2 [somatomedi 


3.54 


426657 


tiki A4 PA^i* 

NM_015865 


Hs.171731 


solute carrier family 14 (urea transport 


3.51 


400409 


AF153341 




Homo sapiens winged helix/forichead trans 


3.38 


400844 






NMJ)03105*:Homo sapiens sortilin-related 


3.27 


4vbUol 






Target Exon 


3.22 


Hit £10 


YC5C7Q 
ADOO/O 


Hs.295449 


parv albumin 


3.03 


402230 






Target Exon 


2.96 


454219 


X75042 


Hs.44313 


v-rel avian retfcuJoendolheiiosis viral 


2.89 


403331 






ENSP00000231844*:Ecotropic virus Integra 


2.87 


426088 


AFO380O7 


Hs.166196 


ATPase, Class 1, type 8B, member 1 


285 


452285 


AI358570 


Hs. 123933 


ESTs, WeaWy similar to ZN91JWMAN ZINC 


2.69 


434061 


AW024973 


Hs.283675 


NPD009 protein 


Z66 


418406 


X73501 


Hs.84905 


cytokeratin 20 


2.65 


418818 


AA22B899 


Hs.1 01307 


Homo sapiens HUT1 1 protein mRNA, partial 


Z59 


421594 


R45689 


Hs.21889 


Homo sapiens cDNA FL) 12978 fis, clone NT 


257 


403383 






Target Exon 


2.56 


435583 


AF210317 


Hs.95497 


solute carrier family 2 (facilitated glu 


Z55 


424800 


AL035588 


Hs.1 53203 


MyoD family inhibitor 


2.54 


404606 






Target Exon 


Z53 


418205 


L21715 


Hs.83760 


troponin ), skeletal, fast 


2.53 


431912 


AI660552 


Hs.76549 


ESTs, Weakly similar to A56154 AW subst 


Z52 


413786 


AW613780 


Hs.13500 


ESTs 


Z51 


421100 


AW351839 


Hs.1 24660 


Homo sapiens cDNA: FU21763 fis, clone C 


Z50 


416640 


BE262478 


Hs.79404 


neuron-specific protein 


Z50 


420729 


AW964897 


Hs.290825 


ESTs 


2.50 


402844 






C1000118*:gil9951913|reflNP_062B32.1| pr 


2.48 


401093 




Hs.208067 


C12000586*:giI6330167|dfc5|BAA86477.1! (A 


2.46 


417720 


AA205625 


ESTs 


2.45 


400297 


AI127076 


Hs.306201 


hypothetical protein OKFZp56401278 


Z45 


403818 






Target Exon 


Z44 


440273 


A1805392 


Hs.325335 


Homo sapiens cDNA: FU 23523 fis. done L 


2.44 


418060 


AA211589 


Hs.208047 


ESTs 


Z40 


400843 






NM_003105*:Homo sapiens sortilin-related 


2.38 


446006 


NM.004403 


Ks.13530 


deafness, autosomat dominant 5 


2.35 


401512 






NM»014080:Homo sapiens dual oxtdase-like 


2.34 


446B47 


T51454 


Hs.82845 


Homo sapiens cDNA: FU21930 fis, clone H 


Z32 


417094 


NJVL005895 


Hs.81182 


histamine N-meinyl transferase 


Z31 


436293 


A1601188 


Hs.1 20910 


ESTs 


Z30 


436246 


AW450963 


Hs.1 19991 


ESTs 


Z30 


447578 


AA912347 


Hs.136585 


ESTs. WeaWy similar to JC5314 CDC2BAxfc 


Z29 


417381 


AF164142 


Hs.82042 


sotute carrier ramify 23 (nucieobase tra 


Z28 


426028 


NM001110 


Hs.172028 


a disintegrin and metaDoprotetoase doma 


Z27 


431448 


AL137517 


Hs.306201 


hypothetical protein DKFZp56401278 


Z26 


437181 


A1306615 


Hs.125343 


ESTs, WeaWy similar to KIAA0756 protein 


Z23 


415025 


AW207091 


Hs.72307 


ESTs 


Z18 


412610 


X90908 


Hs.74126 


ratty acid binding protein 6, ileal (gas 


Z04 


424099 


AF071202 


Hs.139336 


ATP-binding cassette, sub-famity C (CFTR 


Z03 


433078 


AW015188 


Hs.1 21575 


Homo sapiens cDNA FU 12231 fis. done MA 


Z01 


416225 


AA577730 


Hs.188684 


ESTs, Weakly similar to PC4259 ferritin 


ZOO 


411880 


AW872477 




gb:hm30f03Jt1 Na,CGAPJhy4 Homo sapiens 


1.99 


452316 


AA29S484 


>is.6126S 


ESTs, Atoderaiely similar to G786_HUMAN P 


1.89 


413804 


T64682 




gb:yc48b02.rl Stratagene Bver (937224) 


1.88 


432306 


Y18207 


Hs.303090 


protein phosphatase 1, regulatory (Inhib 


1.76 


405364 






ENSP00000239138*:Guanine mjcteotide^nd 


" 1.60 


414320 


U13616 


Hs.75893 


ankyrin 3, node of Ranvier (ankyrin G) 


1.52 


401929 






C17001690:gi|6005701|ref|NP-009099.1| AT 


1.00 
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Pkey: Unique Eos probeset Identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

411880 1263110 1 AW872477 BE088101 T05990 

413804 1390710.1 T54682 BE 168 190 BE 168256 

TABLE 10C 



Pkey: Unique number corresponding to an Eos probeset ... Mn . nM4 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et a!/ refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et at, Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 
NLposiHon: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


400752 


7331445 


Minus 


400843 


9188605 


Plus 


400844 


9168605 


Plus 


401093 


8516137 


Minus 


401512 


7622346 


Plus 


401929 


3810670 


Minus 


402230 


9966312 


Minus 


402844 


9369286 


Plus 


403010 


3132346 


Plus 


403381 


9438267 


Minus 


403383 


9438267 


Minus 


40381 B 


8962065 


Minus 


404606 


9212936 


Minus 


404977 


3738341 


Minus 


405364 


2281075 


Minus 


406081 


9123861 


Minus 



NLposition 
36215-36461 

5863-5970,7653-7784,8892-9023,9673-9807.10634-10789,15254-15403,23827-23958 

24746-24872,25035-25204 

22335-23166 

136399-136557 

3167-3286,4216-4310 

29782-29932 

54958-553*3 

7S385-79052 

26009-26178 

119837-121197 

138360-138512.144656-144796 

22310-23269 

43081-43229 

48325-48491.49136-49252 
38115-38691 
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TABLE 11 A: Genes preferenOally expressed In muscle-invasive bladder tumors 



Pkey. Unique Eos probeset Idenlifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 80th percenBe of T2-T4 tumor Als divided by the 80th percentile of Ta tumor Als 



Pkey ExAccn UnigenelD Unigene Title 

423961 D13666 Hs. 136348 penbsfin (OSF-2os) 

421948 L42583 Hs.334309 keratin 6A 

401780 MM_005557*:Homo sapiens keratin 16 (foca 
446619 AU076643 Hs.313 secreted phosphoproteln 1 (osteopontin, 
444381 BE387335 Hs.283713 ESTs, Weakly sirrtfar to S 64054 hypotheti 
439926 AW014875 Hs.137007 ESTs 

408243 Y00787 Hs.624 Interieukin8 

414183 AW957446 Hs.301711 ESTs 

411573 AB029000 Hs.70823 KIAA1077 protein 

414522 AW518944 Hs.76325 step II spHclng factor SLU7 

413063 AL035737 Hs.75184 chitinase 3-Uke 1 (cartilage glycoprote 

441633 AW958544 Hs.1 12242 normal mucosa of esophagus specific 1 

427337 246223 Hs.176663 Fc fragment of IgG, low affinity lllb, r 

420859 AW468397 Hs. 100000 S 100 calcium-binding protein A8 (calgran 

422168 AA586B94 Hs.1 12408 SlOOcalcium-tanding protein A7(psorias 

418870 AF147204 Hs.89414 ctoernokine (C-X-C motif), receptor 4 (fUs 

401781 Target Exon . 

421116 T19132 Hs.101850 retinol-bindlng protein 1, cellular 

448429 D17408 Hs.21223 calponin 1, basic, smooth muscle 

414020 NM 002984 Hs.75703 small inducible cytokine A4 (homologous 

447526 AL048753 Hs.303649 small Inducible cytokine A2 (monocyte ch 

424247 X140O8 Hs.234734 lysozyme (renal amyloidosis) 

456525 AW468397 Hs.1 00000 S100 calcium-binding protein A8 (calgran 

418007 M13509 Hs.83169 matrix metafloproteinase 1 (interstitial 

406663 U24683 Hs.293441 Immunoglobulin heavy constant mu 

425593 AA278921 Hs.1 908 proteoglycan 1, secretory granule 

433336 AF017986 Hs.31386 secreted frizzled-related protein 2 

425118 AU076611 Hs.1 54672 methylene lelrahydrofolate dehydrogenase 

415994 NM 002923 Hs.78944 regulalor of G-protem signalling 2. 24k 

412326 R07566 Hs.73817 small inducible cytokine A3 (homologous 

422158 L10343 Hs.112341 protease inhibitor 3, skin-derived (SKAL 

446921 AB012113 Hs.1 6530 small Inducible cytokine subfamily A (Cy 

433470 AW960564 transmembrane 4 superfamfly member 1 

417880 BE241595 Hs.82848 selecfin L (lymphocyte adhesion molecule 

446500 U78093 Hs.1 51 54 sushkepeat-containing protein, X chrom 

413324 V0O571 Hs,75294 corticotropfn releasing hormone 

436729 BE621807 transmembrane 4 superfamfly member 1 

450455 AL1 17424 Hs.25035 chlorWe intracellular channel 4 

413731 BE243845 Hs.75511 connective tissue growth factor 

412429 AV650262 Hs.75765 GR02 oncogene 

418283 S79895 Hs.83942 cathepsm K (pycrexjysostosis) 

418299 AA279530 Hs.83968 integrin, beta 2 (antigen CD18 (p95), ly 

420899 NM_001629 Hs.100194 arachidonate 54ip<aygenase-aciivating p 

40028B X06256 Hs.149609 integrin. alpha 5 (fibrcnectin receptor, 

437446 AA788946 Hs.1 01 302 ESTs, Moderately simlar to CA1C RAT COL 

413441 A1929374 Hs.75367 Src-fflce-adapter 

404854 Target Exon 

431319 AA873350 Hs.302232 ESTs 

452432 AW206008 Hs.283378 HoTno sapiens cDMA: FU21778 fis, clone H 

429679 NM 006290 Hs.21 1600 tumor necrosis factor. alpha-Induced pro 

428330 L22524 Hs.2256 matrix melallopfoteinase 7 (matrilysin. 

408380 AF 123050 Hs.44532 diubiquitin 

431103 M57399 Hs.44 pteiotrophm (heparin binding growth fac 

422545 X02761 Hs. 287820 fibronecfinl 

418203 X54942 Hs.83758 COC28 protein kinase 2 

409956 AW103364 Hs.727 InhiWn, beta A (activin A, actrvin AB a 

406687 M31 1 26 matrix melallopfoteinase 1 1 (strometysin 

414359 M62194 Hs.75929 cadherin 11, lype 2, Oacadherin (osteob 

417259 AW903838 Hs.81800 chondroiBn sulfate proteoglycan 2 (vers 

417497 AW402482 Hs.82212 CD53anBgen 

449335 AW150717 Hs.345728 STAT induced STAT inhibitor 3 

445033 AV652402 HsJ2901 mucin 13.epaheMaJtransnTembrane 

427274 NM 005211 Hs.174142 colony stimulating factor 1 receptor, fb 

427527 A1809057 Hs.293441 InmJnoglobuJin heavy constant mu 

409142 AL138877 Hs.50758 SMC4 (structural maintenance of chromoso 

453331 A1240665 ESTs 

428036 AW068302 Homo sapiens mRNA for caWesmon, 3 UTR 

417366 BE185289 Hs.1076 small proUnevrich protein 1B (comifin) 



R1 

11.22 

10.87 

9.16 

140 

7.88 

7.73 

7.54 

7.00 

6.52 

6.42 

6.14 

6.04 

5.66 

5.62 

5.51 

5.48 

5.46 

5.41 

5.41 

5.32 

5.32 

5.27 

5.22 

5.17 

5.08 

4.93 

4.89 

4.74 

4.64 

4.39 

4.30 

4.29 

4.23 

4.22 

4.22 

4.20 

4.18 

4.15 

4.09 

4.00 

4.00 

4.00 

3.97 

3.95 

3.94 

191 

3.81 

3.77 

3.76 

172 

3.58 

358 

157 

3.52 

.149 

148 

141 

136 

132 

130 

125 

123 

123 

3.22 

3.18 

115 

110 

3.09 
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414622 AI752666 Hs.76669 nioolinamldeN-melhyltransferase 

41847B U3894S Hs.1 174 cycEn-dependent kinase Inhibitor 2A (me 

417771 AAB04598 Hs.82547 reUnote acid irceptor responder (lazaro 

413936 AF113676 Hs.297681 serine (or cysteine) proteinase inhibilo 

406755 NB0129 Hs.94360 metalloihtonein 1L 

426653 AA530892 Hs.1 71695 dual specificity phosphatase t 

443623 AA345519 Hs.9641 complement component 1, q subcomponent, 

443907 AU076484 Hs.9963 TYRO protein tyrosine Hnase binding pro 

422048 NNL012445 Hs.288126 spondin 2, extracellular matrix protein 

410204 AJ243425 Hs.326035 early growth response 1 

438973 AW959503 Hs.60440 ESTs, Weakly similar to serin protease w 

420202 AL036557 Hs.95910 putative lymphocyte GOttBI switch gene 

422626 AA344932 Hs.1 18786 metaltothjonein 2A 

442402 NNL000954 Hs.8272 prostaglandin D2 synthase (21kD. brain) 

413902 AU076743 Hs.75613 C036 antigen (collagen type I receptor, 

434868 R50032 Hs.159263 collagen, type VI. alpha 2 

407207 T03651 Hs.336780 tubulin, beta polypeptide 

438855 AW946276 Hs.6441 Homo sapiens mRNA; cDNA DKF2p586J021 (fr 

430413 AWB42182 Hs.241392 small "inducible cytokine A5 (RANTES) 

424909 S78187 Hs.1 53752 cell division cycle 25 B 

419938 AU076772 Hs.1 279 complement component 1,r subcomponent 

416819 U77735 Hs.60205 pim-2 oncogene 

422562 AI962060 Hs.1 18397 AE-binding protein 1 

414081 AW969976 Hs.279009 matrix Gla protein 

426406 AI742501 Hs.1 69756 complement component 1,s subcomponent 

443950 NMJW1425 Hs.9999 epitheBal membrane protein 3 

418323 NMJTO2118 Hs.1 162 major histocompatibility complex, class 

414420 AA043424 Hs.76095 immediate early response 3 

415149 X12451 Hs.78056 calhepsinL 

415213 NM 002933 Hs.78224 ribonuctease. RNase A family, 1 (pancrea 

421848 X15880 Hs.1 08885 collagen, type VI, alpha 1 

452516 AA05B630 Hs.29759 RNA POLYMERASE I AND TRANSCRIPT RELEASE 

427676 AA394062 Hs.300772 tropomyosin 2 (beta) 

415198 AW009480 Hs.943 natural kilter cell transcript 4 

424390 AW815657 Hs.182241 interferon induced transmembrane protein 

426825 AL1 33415 Hs.297753 virnenfin 

452363 AI582743 Hs.94953 Homo sapiens, Similar to complement comp 

407694 U77594 Rs.37682 retinoic acid receptor responder (tazaro 



3.07 
3.02 
2.77 
2.75 
2.75 

^67 

2.65 

2.64 

151 

2.46 

2.46 

244 

2.44 

243 

2.42 

2.42 

2.30 

2.29 

2.20 

2.18 

2.17 

2.11 

2.07 

2.07 

2.03 

2.01 

1.94 

1.90 

1.72 

1.70 

1.69 

1.64 

1.64 

1.60 

1.59 

1.51 

1.46 

1.44 



TABLE HP 

Pkey. Unique Eos probeset identifier number 
CAT number Gene duster number 
Accession: Genbank accession numbers 



a. as* srsssssssrs 



^51813 /W78WhWW7737 T68350 F07712 AA121 145 H08973 AA345212 BE000867 AW068210 AW608407 R0567 H1671 NS5426 
SSmSSSSfW W46881 W9077B N71242 AA534B26 AUM0676 R23797 H96450 
22S«7 StM47 W4^A^2TO AA2SS997 M026215 AW579469 AW365135 AW365134 AW9943S3 AW972886 AW069166 

JS^^ aSb A^7^Sl81 aSb^OOT AW891837 H83985 AW182753 A1422979 AK79733 BE006555AL048166 



AW608074AV^83AJ589944AA665817AW19^ 

iiMi»<nq7 A137M<» C16471 AA652809 AA936687 AA506512 C16306 AW028413 A1537935 AA528347 C16255 AW029046 C16202 

Ss2 Sal A^K? ^m3^2^2KB19 CIKK AA7S9137 AA693351 Z40779 C16577 AA885045 AW073763 R45484 

A1^10AL047737AV659047AV659632AI7^ 

AA653188 AI752459 AA85371 1 AW950663 R78964 R36359 R21626 R21522 

221 
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10 

436729 42585.1 



433470 3872 1 AW950564 AA092457 T55890 D55120T92525 AI815987 BE18260B BE182595 AW080238 M90657 AA34723B AW9S1686 AW176446 

AA304671 AW5B3735 T61714AA316958AI448615AA343532 M083489 M488005 W5209SW39480N57402 D8 
D82729 D5B990 BE619182 AA315188 AA308636 AA1 12474 W76162 AAQ88544 H52265 AA301631 H80982 AA113786 BE620997 AW651691 
AA343799 BE613669 BE547180 BE546656 F1 1933 AA37680Q AW239185 AA376086 BE544387 BE619041 AA452515 AAO018O6 AA190873 
M180483 AA159546 F00242 Al 940609 AI940602 A1189753 T97663 T66110 AW062896 AW062910 AW062902 AI051622 AJ828930 
AA102452 AI685095 AI81 9390 AA557597 AA383220 AI804422 A1633575 AW338147 AW603423 AW506800 AW750557 AW51 0672 M290777 
AA083510 AW829109 AW513200 AA921 353 A1677934 Al 148698 AI955858 AA173825 AA453027 A1027865 AVW75542 AA4540M AA733014 
A1591 384 R79300 R80023 AA843108 AA626058 AA844B98 AW375550 AA889018 A1474275 AW205937 A1O52270 |AW3BB \1 ' AWJ88 JJ 1 
AA699452 AI242230 N47470 H38178 AA366521 AA1 13196 AA130023 H39740 T61629 AI885973 AW083671 AA1 79730 AA305757 AI285455 
N83956 AA216013 AA336155 AW999959 T97525 AA345349 T91762 AA771981 A1285092 AJ591386 BE3924B6 BE385B52 AA682601 
AI682884 AA345840 T85477 AA292949 AA932079 AA098791 D82607 T48574 AW752038 C06300 

BE621807 A1445461 A1346835 AI453743 A1564644 A1928364 AW984527 BE156214 A1694111 AI591358 (^^} t SSS^SL 
AW071625 AI578712 C17732 D57559 H61762 AI720939 AI262930 H27252 AA47971 2 A1927769 AA291465 AA155661 A1963432 ^»7995 
M421678A!925607AA292956AA192448AW192593AJ855838AI69e905AJ871950AI911921 BE619741 BE439796 AJ161312 A^97^1 
15 AI424384AI093510AI240988AW820230AW92554BE0^ 

AA088439 AW806193 AW806183 AA479834 BE501957 AA129574 R3B1 14 AA649494 AA524526 BE327120 AWCT2531 ™™ 8 6 
AW015724 AA043217 AW772000 AI799814 A1671727 AW779725 AA502832 AI470033 AA1 29575 W38161 AI972739 AA4O4570 AA62768B 
AA723200 AA147228 AA903050 AI990245 A1075878 T32487 C061 23 AA1 57944 AI800106 W60075 AI859160 AA478328 AW673152 
AA182640 AI990827 AW275048 AW103470 A1298935 AW471421 R791 90 AW085158 W45410 AI333170 AW30(^6 AAW^17 T55840 
20 AI823466 AI692846 AA962397 AW191997 AA136658 A1251817 BE044134 AW339104 AW517762 AA724739 RJ^J^ViS^l 

AA037696 AA190966 AA757735 AW772283 AA010631 H80983 A1769516 H64985 A1061065 AI950693 1 AA085492 AI245632 ^8594 
AW088968 BE156360 A1349390 A1621320 AI738844 AW1 94272 AA148284 AA953883 C06385 AA487893 A192721 AI91W23 A1453453 
A1798502 A1189366 A1261359 AI032569 AW338678 AI972899 A1500576 AI872628 A1693030 228771 A1985583 AI363829 AVW39301 
AA581093A1S50333 W60032 M603586AI686240AW24295BM 

AI918453 AA03557S AI472527 AW351556 AA191414 AW674145 D57558 AI446740 D57845 AI589264 C05782 AA722206 A1432033 R21752 
BEi™S0AJ458237AW^ 

A1348877 AI798415 D1 1921 A1377596 AI983655 AI744233 C061 1 1 A1248307 AA948565 AI224807 

Sq2F 31981 ^124^026^^5018 R75957 A1803329 R27528 R36203 AI809932 AI808765 R78948 AA411449 AA976929 AI378760 

30 

JV AI350463R31935AW069127 AA411621 R25671 R36105H12451 H03869 H51263AA035486 R25109 R251 10 AA1 47933 



25 

453331 96214J 



TABLE 11C . 



35 2? th-mm*******™™ 

sequence of human chromosome 22/ Dunham I. el al., Nature (1999) 402:489495. 

Strand- Indicates DNA strand from which exons were predicted. 

^ NLposlHon: Indicates nucleotide positions of predicted exons. 

401780 7249190 Mta ffl£^17.2B»^ 

401781 7249190 Zus 8321 WJ3435.B3531 -83656,83740*3901. 84237^4393,B4955^037,8629W6814 
45 404854 7143420 Plus 14260-14537 



222 



WO 03/003906 



PC17US02/21338 



TABLE 12A: Genes preferentially expressed in muscte-frrvasjve Madder tumors 



Pkey: Unique Eos probeset identifier number 

ExAocn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

Seq ID No: Sequence Identification Number Unking Information in Table 12A to sequences In Table 13 



Pkey 


ExAccn 


UnigenelO 


424503 


NNC002205 


Hs.149609 


400289 


X07820 


Hs.2258 


418007 


M13509 


Hs.83169 


418738 


AW388633 


Hs.S6B2 


406964 


M21305 




429276 


AF056085 


Hs.198612 


428227 


AA321649 


Hs.2248 


410102 


AW248508 


Hs.279727 


425247 


NM005940 


Hs.155324 


424735 


U31875 


Hs.272499 


420159 


A1572490 


Hs.99785 


415511 


AI732617 


Hs. 182362 


452461 


N78223 


Hs.108106 


413324 


V00571 


Hs.75294 


443211 


AJ128388 


Hs.143655 


439926 


AW014B75 


Hs.1 37007 


432222 


AI204995 




408908 


8E296227 


Hs.250822 


443171 


BE281128 


Hs.9030 


432829 


W60377 


Hs.57772 


410553 


AW016B24 


Hs.272068 


419183 


U60669 


Hs.89663 


425721 


AC0Q2115 


Hs.159309 


420370 


Y13645 


Hs.97234 


437852 


BE001836 


Hs.256897 


428664 


AK001666 


Ks.189095 


456034 


AW450979 




421110 


AJ250717 


Hs.1355 


451666 


Z43948 


Hs.326444 


408243 


Y00787 


Hs.624 


440304 


BE159984 


Hs.125395 


414918 


AI219207 


HsJ2222 


418867 


031771 


Hs.89404 


426088 


AF038007 


Hs.166196 


405033 






422282 


AF019225 


Hs.1 14309 


425B52 


AK001504 


Hs.159651 


400844 






404875 






422809 


AK001379 


Hs.121028 


431347 


A1133461 


Hs.251664 


413804 


T64682 




444163 


AI126098 




444444 ■ 


AI149332 


Hs.14855 


427747 


AW411425 


Hs.180655 


419741 


NM-007019 


Hs.93002 


420281 


A1623693 


Hs.323494 


402305 






436608 


AA628980 




427239 


BE270447 




413278 


BE563085 


Hs.833 


424008 


R02740 


Hs.137555 


414907 


X90725 


Hs.77597 


421379 


Y15221 


Hs.103982 


426028 




Hs.172028 


417079 


U55590 


Hs.81134 


401093 






422247 


U18244 


Hs.113602 


449722 


BE280074 


Hs.23960 


423673 


BE003054 


Hs.1695 


418478 


U38945 


Hs.1 174 


439606 


W79123 


Hs.58561 


418543 


NWL005329 


Hs.85962 


418678 


NMJJ01327 


Hs.167379 


428182 


BE386042 


Hs.293317 


427335 


AA448542 


Hs.251677 


409420 


215008 


Hs.54451 



Unigene Title Seq ID No. 

Integrin, alpha 5 (fibronectin receptor, Seq ID No. 1 & 2 

matrix mslalloproleinase 10 (stromerysin Seq ID No. 3 & 4 

matrix metaUoprotelnasa 1 (interstitial Seq ID No. 5 & 6 

solute carrier family 7, (cafmic amino Seq ID No. 7 & 8 

FGENES predicted novel secreted protein Seq ID No. 9 & 1 0 

G protein-coupled receptor 51 Seq ID No. 11 & 12 

small inducible cytokine subfamily B (Cy Seq ID No. 13 & 14 

ESTs; homologue of PEM-3 [Qona savignyl Seq ID No. 15 & 16 

matrix metaDoprotBinase 1 1 (stromerysin Seq ID No. 17 & 18 

short-chain alcohol dehydrogenase family Seq ID No. 19 & 20 

Homo sapiens cDNA: RJ21245 fis, clone C Seq ID No. 21 & 22 

ESTs Seq ID No. 23 -25 

transcription factor Seq ID No. 26 4 27 

corticotropin releasing hormone Seq ID No. 28 & 29 

ESTs Seq 10 No. 30 

ESTs Seq ID No. 31 & 32 

gb:an03c03jt1 Stratagene schizo brain S1 Seq ID No. 33 

serine/threonine kinase 15 Seq ID No. 34 & 35 

TONOU Seq ID No. 36 & 37 

ESTs Seq ID No. 38 & 39 

hypothetical protein MGC14128 Seq ID No. 40 & 41 

cytochrome P450, subfamily XXIV (vitamin Seq ID No. 42 & 43 

uroptakinlA Seq ID No. 44 & 45 

urop)akin2 Seq ID No. 46 & 47 

ESTs, Weakly similar to dJ3650121 [H.sa Seq ID No. 48 & 49 

similar to SALL1 (sal (DrosophilaHfce Seq ID No. 50 & 51 

gb:Ul-H-B13-ala-a-1 2-0-U1 ^ 1 NCLCGAP_Su Seq ID No. 52 

cathepsinE Seq ID No. 53 & 54 

cartilage addle protein 1 Seq ID No. 55 - 60 

interteuklnS Seq ID No. 61 & 62 

ESTs SeqiDNo.63&64 

hypothetical protein FU13459 Seq ID No. 65 & 66 

msh (£>osophila) homeo bew homoJog 2 Seq ID No. 67 & 68 

ATPase, Class I, type 88, member 1 Seq ID No. 69 & 70 

C1002652*.-gl|544327|spiQ04799IFMO5J?ABfT Seq ID No. 71 & 72 

apoHpoprolein L Seq ID No. 73 & 74 

death receptor 6, TNF superfamSy member Seq ID No. 75 & 76 

NM_003105*:Homo sapiens sorfiHn-related Seq ID No. 77 & 78 

NM.02281 9*:Homo sapiens phospholipase A2 Seq ID No. 79 & 80 

hypothetical protein FU10549 Seq ID No. 81 & 82 

insulin-tike growth factor 2 (somatomedi Seq ID No. 83 & 84 

gb.7c48b02j1 Stratagene Over (937224) Seq ID Na 85 & 86 

FGENESH predicted RNaseH domaln-containl Seq ID No. 87 - 89 

ESTs Seq IO No. 90 & 91 

serine/threonine kinase 12 Seq ID No. 92 & 93 

ubtquIHn earner protein E2-C Seq ID No. 94 & 95 

Predicted cation efflux pump Seq ID No. 96 & 97 

C19000735-:gil4508027|reflNP_003414.1| z Seq (D No. 98 - 100 

down syndrome critical region protein OS Seq ID No. 101 & 102 

ubtquitin carrier protein Seq ID No. 1 03 & 1 04 

interferorvstimulated protein, 1 5 kDa Seq ID No. 105 & 106 

putative chemokine receptor; GTP-bmdlng Seq ID No. 107 & 108 

poto (DrosophiaJ-fike kinase Seq ID No. 109 & 1 10 

small Inducible cytokine subfamily B (Cy Seq ID No. 1 1 1 & 1 12 

a di$ln\egrm and metatfoproteinase doma Seq ID No. 1 13 & 1 14 

interieukin 1 receptor antagonist Seq ID No. 115 & 1 16 

C1 2000686':gil63301 67fdbj|aAAB6477.1 1 {A Seq ID No. 117 - 119 

solute carrier family 1 (high affinity a Seq ID No. 120 & 1 21 

cyclinBI Seq ID No. 122 & 123 

matrix melalloprotemase 12 (macrophage Seq ID No. 124 & 1 25 

cydin-dependent kinase inhibitor 2A (me Seq 10 No. 126 - 1 33 

G protein-coupled receptor 87 Seq ID No. 134 & 135 

hyaluronan synthase 3 Seq ID No. 136 & 137 

cancerrteslis antigen (NY-ESO-1) Seq ID No. 138 - 141 

ESTs. Weakly sim3ar to GGC INHUMAN G ANT Seq ID No. 142 & 143 

G antigen 7B Seq ID No. 144 & 145 

laminin, gamma 2 (ntcein (lOOkD), kafini Seq ID No. 146 & 1 47 

223 



WO 03/003906 



PCT/US02/21338 



404440 






433091 


T \£Wc 


h5.0IO& 




Anoint 


Hs.44532 




Aw/4/uyu 


Hs.57101 




KHJ fWVJ/G7 


Li- * Mink 

HS. 153704 








A01A7Q 

4ir74/o 


AlCBQIjf] 

AJooJ24J 


Hs.97258 




NMJJ12152 


Hs.258583 


CO 

4J31&y 


AB0358S8 


Hs. 150587 


432441 


AW292425 


Ms. 163484 


425726 


AF085808 


re. 159330 


440U4& 


A IOOTjIOC 

AJ^y/4oo 


Hs.20166 




BE019924 


Hs.271580 


Ann A A 
4J/U44 


AL035854 


Hs.69517 


444381 


8E387335 


Hs. 28 37 13 


400303 


AA242758 


Hs.79136 


452747 


BE153855 


Hs.61460 








428484 


AF104032 


Hs, J 84601 


429211 


AF052693 


Hs.198249 


417389 


BE260964 


Ks.82045 


445537 


AJ245671 


Hs.12644 


423961 


D13666 


Hs.136348 


417433 


BE270265 


Hs.82128 


444781 


N&UM4400 


Hs.11950 


430486 


BE062109 


Ns.241551 


425650 


NW.001944 


Hs.1925 


409103 


AF251237 


Hs.112208 


129404 


AI267700 




403047 






439738 


BE246502 


Hs.9598 



NM_021048:Homo sapiens melanoma antigen, Seq ID No. 148 & 149 

lymphocyte antigen 6 complex, locus D SeqID No. 150 & 151 

dhiblquiUn Seq ID No. 152 & 153 

mmichjomosome maintenance deficient (S. Seq \0 No. 154 & 155 

NIMA (never in mitosis gene abated k SeqID No. 156 & 157 

ESTs Seq ID No. 158 

ESTs, Moderately similar to S29539 ribos Seq ID No. 159 & 160 

EDG-7 (endothelial differentiation, tys Seq ID No. 161 & 162 

ktnesin-tike protein 2 Seq ID Wo. 163 & 164 

ESTs Seq ID No. 165 

uroplaxin3 Seq ID No. 166 & 167 

prostate stem cell antigen Seq ID No. 168 & 1 69 

uroplakinlB Seq ID No. 170 & 171 

differentially expressed In FanconPs an Seq ID No. 172 & 1 73 

ESTs, Weakly similar to S54054 hypothec Seq ID No. 174 & 175 

UV-1 protein, estrogen regulated Seq ID No. 176 & 177 

Ig superfamjry receptor LNIR Seq ID No. 178 & 179 

hypothetical protein DKFZp56401 278 Seq ID No. 180 & 1 81 

solute carrier family 7 (cationic amino Seq ID No. 182 & 183 

gap junction protein, beta 5 (connexin 3 Seq ID No. 1 84 & 185 

midta'ne (neurite growtn-pfomctlag facto Seq ID Not 186 & 187 

EGF-like-domain, multiple 6 Seq ID No. 188 & 189 

periosUn (OSMos) Seq ID No. 190 & 191 

5T4 oncofetal trophohlast glycoprotein Seq ID No. 192 & 193 

GPl-anchored metastasis-associated protein homolog Seq ID No. 194 & 1 95 

chbride channel, calcium activated, family member 2 Seq ID No. 196 & 197 

desmogleln 3 (pemphigus vulgaris antigen) Seq ID No. 198 & 1 99 

XAGE-1 protein Seq ID No. 200 & 201 

ESTs Seq ID No. 202 

NMJJ05656*:Homo sapiens transmembrane protease Seq ID No. 203 & 204 

sema domain, immunoglobulin domain (Ig), Iransmem Seq ID No. 205 & 206 



TABLE \JB 



Pkey: Unique Eos probeset identifier number 
CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

413804 1390710.1 T64682 BE168190 BE168256 

427239 27647.1 BE270447 AW409921 BE207288 BE207170 D56355 BE263223 BE408171 BE262243 BE392439 BE292738 BE281776 BE314300 BE267719 

BE268715 BE513876 BE295291 BE297066 AA210923 BE407519 H51344 BE622905 AW248281 AW250313 T19021 AA355115 AA316879 
BE269633 BE621936 AA290724 AA380031 AA993616 BE301331 W79220 H73020 AW797850 BE27461 1 BE314315 BE542407 BE562063 
BE3t 2866 BE27881 7 BE252503 AA426470 M91670 NM_0 14501 BE264533 A1884863 AI884942 AI678077 AA829937 AI869333 AW273060 
AA906924 AJ571 293 AW172642 AA582906 AW674698 AW300936 AW615753 AW615587 AW665495 AJ859296 AA594105 AA9281 10 
AJ038859 AI589898 AA828316 AA938955 AA283711 AW170665 BE047759 AW732128 A10^5067 AW70748 AW248955 AA975490 
AA21 1097 Al 826097 AA826730 AA994072 A1367867 W74508 AI086474 AA827720 AI922589 AW000888 W58033 AA991 461 BE048131 
M91 218 AAB761 88 AA661759 AI1 23879 AI089508 AW272915 AI565244 AW245061 AA688299 AW250988 AI953468 AI891 144 AW089131 
A1471577 AA293354 AA464019 BE044549 T29587 AW956171 BE537716 BE269400 AA918328 AI538087 AA969243 AW075033 BE1 39361 
AI340589 AI250128 A124703B BE138953 AW075177 AI307208 BE049086 AW302327 A1054335 AI345565 A1334881 AI252075 AI254494 
AW075006 AW302733 AW302738 AI054057 AI054217 AI054172 A1054302 Al 053722 AJ054060 A1054079 AW075181 A1307473 A1312145 
A1340734 AI334909 AW071374 BE138502 AW074809 AW301901 AJ251662 AI307559 BE 139228 AI254764 AW073049 A1251 264 A1802837 
AW271 B67 A1307442 AW075100 AW073456 AW072496 AI270787 AW271039 AW071307 A1610913 AW071 289 A1251232 AI251262 
AW073656 AW072901 A1307493 AI255068 AI251 289 AI252160 AI271496 A1252427 AW073469 AW071420 AI270156 A1252926 A1252839 
AI252868 AW072520 AW073433 A1340643 AI802854 AJ334733 AI5B3896 AW071 31 1 AI802853 AI28971 1 A1345036 AW072513 AI348921 
AI307478 AI305762 BE 139315 AW271034 A1334886 A1340619AW470478 BE139260 8E138965 AW302085 BE041872 AW268964 

432222 343347 1 A1204995 AW827539 AW969908 AW440776 AA528756 

436608 42361 3 AA628980 AI126603 BE504035 

438817 46559il AI023799AA826307AI683094 A1307373A1870547 AW979007 
444163 59365BJ AI126098AJ1 84746 A1148521 

456034 142696 1 AW450979 AA1 36653 AA1 36656 AW419381 AA984358 AA492073 BE1 68945 AA809054 AW238038 BE01 1212 BE01 1359 BE011367 
BE01 1368 BE01 1 362 BE01 1 21 5 BE01 1365 BE01 1363 



TABLE 12C 

Pkey: Unique number corresponding to an Eos probeset 

Reft Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers, "Dunham L et al." refers to the publication entitled The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 
Ntpositbn: Indicates nudeoOde positions of predicted exons. 



Pkey Ref Strand NLposition 

400844 9188605 Plus 24746-24872,25035-25204 

401093 8516137 tonus 22335-23166 
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WO 03/003906 



PCT/US02/21338 



402305 7328724 Plus 

404440 7528051 Plus 

404875 9801324 Plus 

405033 7107731 Minus 



4083241362 
80430-81681 

96588-96732,97722-97831 
142358-142546 



225 



WO 03/003906 



PCT/US02/21338 



Table 13 

Seq ID NO: 1 DNA sequence; 

Nucleic Acid Accession It: NM_002205 

Coding sequence: 24.. 3173 



1 11 21 31 41 51 

I I I I I 

CAGGACAGGG AAGAGCGGGC GCTATGGGGA GCCGGACGCC AGAGTCCCCT 
10 TGCAGCTGCG CTGGGGCCCC CGGCGCCGAC CCCCGCTCGT GCCGCTGCTG 
TGCCGCCGCC ACCCAGGGTC GGGGGCTTCA ACTTAGACGC GGAGGCCCCA 
CGGGGCCCCC GGGCTCCTTC TTCGGATTCT CAGTGGAGTT TTACCGGCCG 
GGGTCAGTGT GCTGGTGGGA GCACCCAAGG CTAATACCAG CCAGCCAGGA 
GTGGTGCTGT CTACCTCTGT CCTTGGGGTG CCAGCCCCAC ACAGTGCACC 
15 TTGACAGCAA AGGCTCTCGG CTCCTGGAGT CCTCACTGTC CAGCTCAGAG 
CTGTGGAGTA CAAGTCCTTG CAGTGGTTCG GGGCAACAGT TCGAGCCCAT 
TCTTGGCATG CGCTCCACTG TACAGCTGGC GCACAGAGAA GGAGCCACTG 
TGGGCACCTG CTACCTCTCC ACAGATAACT TCACCCGAAT TCTGGAGTAT 
GCTCAGATTT CAGCTGGGCA GCAGGACAGG GTTACTGCCA AGGAGGCTTC 
20 TCACCAAGAC TGGCCGTGTG GTTTTAGGTG GACCAGGAAG CTATTTCTGG 
TCCTGTCTGC CACTCAGGAG CAGATTGCAG AATCTTATTA CCCCGAGTAC 
TGGTTCAGGG GCAGCTGCAG ACTCGCCAGG CCAGTTCCAT CTATGATGAC 
GATACTCTGT GGCTGTTGGT GAATTCAGTG GTGATGACAC AGAAGACTTT 
TGCCCAAAGG GAACCTCACT TACGGCTATG TCACCATCCT TAATGGCTCA 
25 CCCTCTACAA CTTCTCAGGG GAACAGATGG CCTCCTACTT TGGCTATGCA 
CAGACGTCAA TGGGGACGGG CTGGATGACT TGCTGGTGGG GGCACCCCTG 
GGACCCCTGA CGGGCGGCCT CAGGAGGTGG GCAGGGTCTA CGTCTACCTG 
CCGGCATAGA GCCCACGCCC ACCCTTACCC TCACTGGCCA TGATGAGTTT 
GCAGCTCCTT GACCCCCCTG GGGGACCTGG ACCAGGATGG CTACAATGAT 
30 GGGCTCCCTT TGGTGGGGAG ACCCAGCAGG GAGTAGTGTT TGTATTTCCT 
GAGGGCTGGG CTCTAAGCCT TCCCAGGTTC TGCAGCCCCT GTGGGCAGCC 
CAGACTTCTT TGGCTCTGCC CTTCGAGGAG GCCGAGACCT GGATGGCAAT 
ATCTGATTGT GGGGTCCTTT GGTGTGGACA AGGCTGTGGT ATACAGGGGC 
TGTCCGCTAG TGCCTCCCTC ACCATCTTCC CCGCCATGTT CAACCCAGAG 
35 GCAGCTTAGA GGGGAACCCT GTGGCCTGCA TCAACCTTAG CTTCTGCCTC 

GAAAACACGT TGCTGACTCC ATTGGTTTCA CAGTGGAACT TCAGCTGGAC 
AGAAGGGAGG GGTACGGCGG GCACTGTTCC TGGCCTCCAG GCAGGCAACC 
CCCTGCTCAT CCAGAATGGG GCTCGAGAGG ATTGCAGAGA GATGAAGATC 
ACGAGTCAGA ATTTCGAGAC AAACTCTCGC CGATTCACAT CGCTCTCAAC 
40 ACCCCCAAGC CCCAGTGGAC AGCCACGGCC TCAGGCCAGC CCTACATTAT 
GCCGGATAGA GGACAAGGCT CAGATCTTGC TGGACTGTGG AGAAGACAAC 
CTGACCTGCA GCTGGAAGTG TTTGGGGAGC AGAACCATGT GTACCTGGGT 
CCCTGAACCT CACTTTCCAT GCCCAGAATG TGGGTGAGGG TGGCGCCTAT 
TTCGGGTCAC CGCCCCTCCA GAGGCTGAGT ACTCAGGACT CGTCAGACAC 
45 TCTCCAGCCT GAGCTGTGAC TACTTTGCCG TGAACCAGAG CCGCCTGCTG 
TGGGCAACCC CATGAAGGCA GGAGCCAGTC TGTGGGGTGG CCTTCGGTTT 
ATCTCCGGGA CACTAAGAAA ACCATCCAGT TTGACTTCCA GATCCTCAGC 
ACAACTCGCA AAGCGACGTG GTTTCCTTTC GGCTCTCCGT GGAGGCTCAG 
CCCTGAACGG TGTCTCCAAG CCTGAGGCAG TGCTATTCCC AGTAAGCGAC 
50 GAGACCAGCC TCAGAAGGAG GAGGACCTGG GACCTGCTGT CCACCATGTC 
TCAACCAAGG CCCCAGCTCC ATTAGCCAGG GTGTGCTGGA ACTCAGCTGT 
TGGAAGGTCA GCAGCTCCTA TATGTGACCA GAGTTACGGG ACTCAACTGC 
ACCCCATTAA CCCAAAGGGC CTGGAGTTGG ATCCCGAGGG TTCCCTGCAC 
AACGGGAAGC TCCAAGCCGC AGCTCTGCTT CCTCGGGACC TCAGATCCTG 
55 AGGCTGAGTG TTTCAGGCTG CGCTGTGAGC TCGGGCCCCT GCACCAACAA 
GTCTGCAGTT GCATTTCCGA GTCTGGGCCA AGACTTTCTT GCAGCGGGAG 
TTAGCCTGCA GTGTGAGGCT GTGTACAAAG CCCTGAAGAT GCCCTACCGA 
GGCAGCTGCC CCAAAAAGAG CGTCAGGTGG CCACAGCTGT GCAATGGACC 
GCAGCTATGG CGTCCCACTG TGGATCATCA TCCTAGCCAT CCTGTTTGGC 
60 TAGGTCTACT CATCTACATC CTCTACAAGC TTGGATTCTT CAAACGCTCC 
GCACCGCCAT GGAAAAAGCT CAGCTCAAGC CTCCAGCCAC CTCTGATGCC 
CAATTTCAGA CTCCCATTCC TGAAGAACCA GTCCCCCCAC CCTCATTCTA 
GGGGTCTGGG TACTTCTTGA AGGTGCTGAC GGCCAGGGAG AAGCTCCTCT 
GAGACATACT TGAAGGGCCA GAGCCAGGGG GGTGAGGAGC TGGGGATCCC 
65 GCACTGTGAA GGACCCTTGT TTACACATAC CCTCTTCATG GATGGGGGAA 
GGGACAGAGG CCCAGCCTCC CTGAAGCCTT TGCATTTTGG AGAGTTTCCT 
GAAAGATAAC TAGGAAATCC ATTCACAGTT CTTTGGGCCA GACATGCCAC 
TGTCCAGCTC CAACCTGCAA AGATCTGTCC TCAGCCTTGC CAGAGATCCA 
CCAGTAAGAA CCTGGAACTT GGGGAGTTAA GACCTGGCAG CTCTGGACAG 
70 GTGGGCCAAC AAAGAACACT AACTATGCAT GGTGCCCCAG GACCAGCTCA 
CACAAGGATA GATGCTGGCC CAGGGCCAGA GCCCAGCTCC AAGGGGAATC 
TGGGGCCAGA TCCAGCCTGG GGTCTGGAGT TGATCTGGAA CCCAGACTCA 
CCAATCCAGG CAGATCCAGG ACTATATTTG GGCCTGCTCC AGACCTGATC 
AGTTCACCCT GATTTAGGAG AAGCCAGGAA TTTCCCAGGA CCTGAAGGGG 
75 AACAGATCTG GAACCTCAGC CTGGCCAGAC ACAGGCCCTC CCTGTTCCCC 

GAGCCCACTG TCCTGGGCCT GCAGAATTTG GGTTCTGCCT GCCAGCTGCA 
CCCTCATCTC TCTGCCCAAC CCTTCCCTCA CCTTGGCACC AGACACCCAG 
AACTCTGTTG CAAGTGCAAT AAATCTGACC CAGTGCCCCC ACTGACCAGA 



1 

CTCCACGCCG 
TTGCTGCTCG 
GCAGTACTCT 
GGAACAGACG 
GTGCTGCAGG 
CCCATTGAAT 
GGAGAGGAGC 
GGCTCCTCCA 
AGCGACCCCG 
GCACCCTGCC 
AGTGCCGAGT 
CAAGGCCAGA 
CTGATCAACC 
AGCTACCTAG 
GTTGCTGGTG 
GACATTCGAT 
GTGGCCGCCA 
CTCATGGATC 
CAGCACCCAG 
GGCCGATTTG 
GTGGCCATCG 
GGGGGCCCAG 
AGCCACACCC 
GGATATCCTG 
CGCCCCATCG 
GAGCGGAGCT 
AATGCTTCTG 
TGGCAGAAGC 
CTGACCCAGA 
TACCTCAGGA 
TTCTCCTTGG 
CAGAGCAAGA 
ATCTGTGTGC 
GACAAGAATG 
GAGGCTGAGC 
CCAGGGAACT 
GTGTGTGACC 
ACAGTCCCTC 
AAGAATCTCA 
GCCCAGGTCA 
TGGCATCCCC 
TATGAGCTCA 
CCCCAGGCTC 
ACCACCAATC 
CACCAGCAAA 
AAATGCCCGG 
GAGAGCCAAA 
CACCAGCCAT 
ATCCTGCCTC 
AAGGCAGAAG 
CTCCTGCTCC 
CTCCCATATG 
TGAGTCCTCC 
CTGAAAAGGA 
CCCCAGCCCA 
TCCCCCCCAT 
CTCAGATCCA 
GAAACAACTG 
AAGGACTTCC 
AAAGAAGCCC 
CCCCACCCTG 
GGACAGATGC 
AGAACTCAAA 
GACATTGGCA 
CTGGAGGCCC 
CCATGATGGC 
AGAGAAAGGG 
CTGATGCTGC 
GACTTATTTA 
ACTAGAAAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3 840 
3900 
3960 
4020 
4080 
4140 
4200 
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Seq ID NO: 2 Protein sequence; 
5 Protein Accession 8: NP_002196.1 

1 11 21 31 41 51 

I I I I I 

MGSRTPESPL HAVQLRWGPR RRPPLVPLLL LLVPPPPRVG GFNLDAEAPA VLSGPPGSFF 60 

10 GFSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

LESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYLST 180 

DNFTRILEYA PCRSDFSWAA GOGYCQGGFS AEFTKTGRW LGGPGSYFWQ GQILSATQEQ 240 

IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGE QMASYPGYAV AATDVNGDGL DDULVGAPLL MDRTPDGRPQ 360 

15 EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 480 

VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLEGNPV ACINIiSFCLN ASGKHVADSI 540 

GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

LSPIHIALNF SLDPQAPVDS KGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

20 GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NIiNNSQSDW 780 

SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 840 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHK QQKREAPSRS 900 

SASSGPQIIiK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 960 

25 YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLI^LLIYIL 1020 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 

Seq ID NO: 3 DNA sequence: 
Nucleic Acid Accession »: NM_002425 
30 Coding sequence: 23.. 1453 



1 11 21 31 41 51 

35 AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 

TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 

AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 

GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 

40 TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 

TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 

TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 

AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 

TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 

45 TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 

CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 

TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 

TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 

GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 



50 GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 

TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCOTTTTAT 1080 

TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 



1200 
1260 
1320 



AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 
55 CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 
TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 

GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 

ACAGTTTGAG TTTGACCCCA ATCCCAGGAT GGTGACACAC ATATTAAAGA GrTAACAGCTG 1440 

GTTACATTGC TAGGCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

60 ATTATTCATC TAATGTATTA IX^CCAAAA TGGTTAATTT TTCCTGCATG ^CTGTCACT 1560 

GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 

ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 

65 m 

Seq ID NO: 4 protein sequence: 
Protein Accession #: NPJ>02416 

70 i ii 2i " j 1 f 

MMHLAFLVLL CLPVCSAYPL SGAAKEEDSN KDtAQQYLBK YYNLEKDVKQ ^R!^ " 

KKIQGMQKPL GLEVTGKLDT DTLBVMRKPR CGVPDVCHFS SFPGMPKHRK THWYRIVKY 120 

TP PJjP RDAVD SAIEKAUCVW EBVTPLTPSR LYECEADIMI SFAVKBHGOF VSFBGPGHSL 180 

75 AHAYPPGPGL YGDIHPDDDE KHTEDASGTO LPLVAAHELO HSLGLFHSAN TEALMYPLTO 240 

SPTELAQFRL SQDDVNGIQS LYGPPPASTE BPLVPTKSVP SGSEMPAKCD PALSFDMST 300 

LRGEYLPFKD RYFWRRSHWN PBPEFHLISA FWPSLPSYLD AAYEVNSRDT VFIPKG.TCFW 360 

AIRGNEVQAG YPRGIHTLGP PPTIRKIDAA VSDKEKKKTY PPAADKYWRF DENSQSMEQG 420 
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FPRLIADDFP GVEPKVDAVL QAFGFFYFFS GSSQFEFDPN ARMVTHILKS NSWLHC 

Seq ID NO: 5 DNA sequence: 

Nucleic Acid Accession fl: NM_00242l 

Coding sequence : 72 1481 



11 



GGGATATTGG 
CAAAGGCCAG 
CTCACAGCTT 
ACCTGGAAAA 
GTGGCCCAGT 
GGAAACCAGA 
TGGCTCAGTT 
GGATTGAAAA 
CCTTCCAACT 
CAGACATCAT 
CTGGAGGAAA 
TTGATGAAGA 
CTCATGAACT 
ACCCTAGCTA 
AAGCCATATA 
CGTGTGACAG 
TTAAAGACAG 
TTTCTGTTTT 
GAGATGAAGT 
TACACGGATA 
TCGATGCTGC 
ACTGGAGGTA 
ATGACTTTCC 
ATTTCTTTCA 
TCCAGAAAGC 
AAAACACATG 
CATTTTTAAC 
CATATHTTT 
ATGGTGGGTA 
GATCTTTTCC 
ATATCCTTTC 
CAAGAACACA 
ACAAAATAAG 



AGTAGCAAGA 
TATGCACAGC 
CCCAGCGACT 
ATACTACAAC 
GGTTGAAAAA 
TGCTGAAACC 
TGTCCTCACT 
TTACACGCCA 
CTGGAGTAAT 
GATATCTTTT 
TCTTGCTCAT 
TGAAAGGTGG 
CGGCCATTCT 
CACCTTCAGT 
TGGACGTTCC 
TAAGCTAACC 
ATTCTACATG 
CTGGCCACAA 
CCGGTTTTTC 
CCCCAAGGAC 
TCTTTCTGAG 
TGATGAATAT 
TGGAATTGGC 
TGGAACAAGA 
TAATAGCTGG 
GTGTGAGTCC 
CTCTAGAGTC 
TACTATTTAG 
CAAAAAGTCA 
AGAGTATGCA 
AAGACAGAAA 
TGTGCAGTCA 
TGTTTTATGT 



21 
I 

GGCTGGGAAG 
TTTCCTCCAC 
CTAGAAACAC 
CTGAAGAATG 
TTGAAGCAAA 
CTGAAGGTGA 
GAGGGGAACC 
GATTTGCCAA 
GTCACACCTC 
GTCAGGGGAG 
GCTTTTCAAC 
ACCAACAATT 
CTTGGACTCT 
GGTGATGTTC 
CAAAATCCTG 
TTTGATGCTA 
CGCACAAATC 
CTGCCAAATG 
AAAGGGAATA 
ATCTACAGCT 
GAAAACACTG 
AAACGATCTA 
CACAAAGTTG 
CAATACAAAT 
TTCAACTGCA 
AAAGAAGGTG 
ACTGATACAC 
AATGTAGCCC 
AGTTTGTGGC 
ACTCTGACGT 
GAGACAGGAG 
CTGGTGTCAC 
TTGGAATAAA 



31 



CCATCACTTA 
TGCTGCTGCT 
AAGAGCAAGA 
ATGGGAGGCA 
TGCAGGAATT 
TGAAGCAGCC 
CTCGCTGGGA 
GAGCAGATGT 
TGACATTCAC 
ATCATCGGGA 
CAGGCCCAGG 
TCAGAGAGTA 
CCCATTCTAC 
AGCTAGCTCA 
TCCAGCCCAT 
TAACTACGAT 
CCTTCTACCC 
GGCTTGAAGC 
AGTACTGGGC 
CCTTTGGCTT 
GAAAAACCTA 
TGGATCCAGG 
ATGCAGTTTT 
TTGATCCTAA 
GGAAAAATTG 
TTTTCCTGAA 
AGAATATAAT 
TTTTTGTACT 
TTATGGATTC 
TGATCCCAGA 
ACATGAGTCT 
CCTGGATAGG 
GTCAACCTTG 



41 
I 

CCTTGCACTG 
GCTGTTCTGG 
TGTGGACTTA 
AGTTGAAAAG 
CTTTGGGCTG 
CAGATGTGGA 
GCAAACACAT 
GGACCATGCC 
CAAGGTCTCT 
CAACTCTCCT 
TATTGGAGGG 
CAACTTACAT 
TGATATCGGG 
GGATGACATT 
CGGCCCACAA 
TCGGGGAGAA 
GGAAGTTGAG 
TGCTTACGAA 
TGTTCAGGGA 
CCCTAGAACT 
CTTCTTTGTT 
TTATCCCAAA 
CATGAAAGAT 
AACGAAGAGA 
AACATTACTA 
GAACTGTCTA 
CTTATTTATA 
GATATAATTT 
ATATAGGCCA 
GAGCAGCTTC 
TTGCOGGAGG 
CAAGGGATAA 
TTTCTACTGT 



51 
I 

AGAAAGAAGA 
GGTGTGGTGT 
GTCCAGAAAT 
CGGAGAAATA 
AAAGTGACTG 
GTGCCTGATG 
CTGACCTACA 
ATTGAGAAAG 
GAGGGTCAAG 
TTTGATGGAC 
GATGCTCATT 
CGTGTTGCGG 
GCTTTGATGT 
GATGGCATCC 
ACCCCAAAAG 
GTGATGTTCT 
CTCAATTTCA 
TTTGCCGACA 
CAGAATGTGC 
GTGAAGCATA 
GCTAACAAAT 
ATGATAGCAC 
GGATTTTTCT 
ATTTTGACTC 
ATTTGAATGG 
TTTTCTCAGT 
CCTCAGTTTG 
AGTTCCACAA 
GAGTTGCAAA 
AGTGACAAAC 
AAAAGCAGCT 
CTCTTCTAAC 
TTT 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



Seq ID NO: 6 Protein sequence: 
Protein Accession #: NP_002412 



MHSFPPLLLI* 
VBKLKQMQEF 
YTPDLPRADV 
LAHAFQPGPG 
TFSGDVQLAQ 
FYMRTNPFYP 
PKDIYSSFGF 
GIGHKVDAVP 



11 
I 

LFWGWSHSF 
FGLKVTGKPD 
DHAIBKAFQL 
IGGDAHFDED 
DDIDGIQAIY 
EVELNPISVP 
PRTVKHIDAA 
MKDGFFYTFH 



21 
I 

PATLETQEQD 
AETLKVMKQP 
WSNVTPLTFT 
ERWTNNFREY 
GRSQNPVQPI 
WPQLPNGLEA 
LSEENTGKTY 
GTRQYKFDPK 



31 
I 

VDLVQKYLEK 
RCGVPDVAQP 
KVSEGQADIM 
NLHRVAAHEL 
GPQTPKACDS 
AYEFADRDEV 
FFVANKYWRY 
TKRILTLQKA 



41 
I 

YYNLKNDGRQ 
VLTEGNPRWE 
ISFVRGDHRD 
GHSLGLSHST 
KLTFDAITTI 
RFFKGNKYWA 
DEYKRSMDPG 
NSWPNCRKN 



51 
I 

VEKRRNSGPV 
QTHLTYRIEN 
NSPFDGPGGN 
DIGALMYPSY 
RGEVMFFKDR 
VQGQNVLHGY 
YPKMIAHDFP 



Seq ID NO: 7 DNA sequence: 

Nucleic Acid Accession #: NM_01433l 

Coding sequence: 1..1506 



60 
120 
180 
240 
300 
360 
420 



ATGGTCAGAA 
AACGGGAGGC 
AAGAGGAAAG 
GGAATCTTCA 
ACCATCTGGA 
GGAACAACTA 
TTACCAGCTT 
GTGATATCCC 
CCTGAACTTG 
AGCATGAGTG 
GCAATTCTGA 
TTTAAAGACG 



11 
i 

AGCCTGTTGT 
TGCCTTCCCT 
TCACTTTACT 
TCTCTCCTAA 
CGGTGTGTGG 
TAAAGAAATC 
TTGTACGAGT 
TGGCATTTGG 
CGATCAAGCT 
TCAGCTGGAG 
TAATTATAGT 
OGTTTTCAGG 



21 
I 

GTCCACCATC 
GGGCAACAAG 
GAGGGGAGTC 
GGGCGTGCTC 
GGTCCTGTCA 
TGGAGGTCAT 
CTGGGTGGAA 
ACGCTACATT 
CATTACAGCT 
CGCCCGGATC 
CCCTGGAGTT 
AAGAGATTCA 



31 
I 

TCCAAAGGAG 
GAGCCACCTO 
TCCATTATCA 
CAGAACACGG 
CTATTTGGAG 
TACACATATA 
CTCCTCATAA 
CTGGAACCAT 
GTGGGCATAA 
CAGATTTTCT 
ATGCAGCTAA 
AGTATTAOGC 



41 

I 

GTTACCTGCA 
GGCAGGAGAA 
TTGGCACCAT 
GCAGCGTGGG 
CTTTGTCTTA 
TTTTGGAAGT 
TACGCCCTGC 
TTTTTATTCA 
CTGTAGTGAT 
TAACCTTTTG 
TTAAAGGTCA 
GGTTGCCACT 



51 
I 

GGGAAATGTT 
AGTGCAGCTG 
CATTGGAGCA 
CATGTCTCTG 
TGCTGAATTG 
CTTTGGTCCA 
AGCTACTGCT 
ATGTGAAATC 
GGTCCTAAAT 
CAAGCTCACA 
AACGCAGAAC 
GGCTTTTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



TATGGAATGT 
AACCCTGAAA 
TATGTGCTGA 
AATGCAGTGG 
ATCTTTGTTG 
TTATTCTATG 
CGCAAGCACA 
TTCTCTGGAG 
GGGCTGGCAG 
TTCAAGGTGC 
CTTTCCCTCT 
GTCCCTGCGT 
TCAGAGAAAA 
TTATGAACTA 
TTTTTACTTC 
CAGTTATTTT 
AACTCTATGT 
GTCTCTGATA 
TCTCTACAAC 
TATATATGGG 
TTTTCAATTC 
ATTTTACATT 
GCTTTAATGG 
TTAAAGAAGA 
AAAAATCCTT 
TTATCTGTCA 
AGCAAGAGTT 
TACCCCTGAT 
TGAGAGAAAT 
CTACATGCAA 
TGAATTTTGA 
CTTCAGATGA 
AAGAAATGTC 
ATCTAGGCTT 
CTGATAAGAA 
GTTTTGCCAG 
GCACTTTGGG 
CAACATGGAG 
GCTGGTAATC 
GAGGTTGCAG 
CCATCTCCAA 



ATGCATATGC 
AAACCATTCC 
CAAATGTGGC 
CAGTGACCTT 
CCCTCTCCTG 
TTGCGTCTCG 
CTCCTCTACC 
ACCTOGACAG 
TTGCTGGGCT 
CACTGTTCAT 
ATTCGGACCC 
ATTATCTCTT 
TAACCAGAAC 
ATGGACTTGA 
ATTTTCTGAA 
TATTCATATA 
AGTTATAGAA 
CCTACCTATT 
ATATGTTAGC 
TTTTGTAAAG 
TGAAAAAAAG 
GACATTGCAT 
ATTATACCCA 
GTTTCTAGGG 
GAGAATTTAT 
TTTTTTTTTT 
AGTTTGGTAT 
GAGTCTATCT 
AACCAACAAA 
TGTTAGTAAT 
CAGTTTGTGC 
AACTGTCCAG 
GCTGTAAATA 
TGTCAGTAAT 
GAAAATTGAA 
TATTAGAAAA 
AGGCTGAGGG 
AAACCCCATC 
TCAGCTATTG 
TGAGCCAAGA 
AAAAAAAAAA 



TGGCTGGTTT 
CCTTGCAATA 
CTACTTTACG 
TTCTGAGCGG 
CTTTGGCTCC 
AGAGGGTCAC 
AGCTGTTATT 
TCTTTTGAAT 
GATTTATCTT 
CCCAGCTTTG 
ATTTAGTACA 
TATTATATGG 
ATTACAAATA 
GATCTTGGCA 
AGTCTAGAGA 
TTTTAGCATA 
AGTGAATATG 
GGGGTTAGGA 
ACGGCAAAGA 
ATGGTTTTAC 
CATACATCAT 
TGCTTCCCCT 
GAGCACTTTG 
GCTACTGTTT 
TATGTCAGAT 
TCACATCAGT 
TAAATCCTCA 
AAACATATGC 
GAAGATGTTC 
TCTGAAGTTT 
ATTTTCTTTA 
ATTAATTAGG 
AGATTTACAA 
TTCCACACCT 
ATGAGAATCT 
TACTGTGAGC 
GGTGGATCAC 
TCTACTAAAA 
AGGAGGCTGA 
rTGCACCACT 



TACCTCAACT 
TGTATATCCA 
ACCATTAATG 
CTACTGGGAA 
ATGAACGGTG 
CTTCCAGAAA 
GTTTTGCACC 
TTCCTCAGTT 
CGATACAAAT 
TTTTCCTTCA 
GGGATTGGCT 
GACAAGAAAC 
ATACTGGAAG 
ATCTGCCCAA 
ATTACAACTT 
TTCGAACTAA 
CAGTTATTCT 
GAAAAGACTA 
ACCTTCAAAT 
ACACTACAGA 
GATTATGGCA 
TAGATACCAA 
AACAAAGGTC 
ATGAGACACA 
GTTTTTTCAT 
TTGATCAGGA 
TTAGAACAAC 
ATTTTAAGCC 
AAAATAATAG 
TTTAAATTTA 
TACATTTTAT 
AAAAGGCATA 
CTGATGTTTC 
TAATTATCAT 
GTGGATAAGT 
CGGGCATGGT 
CTGAGGTCGG 
ATACAAAATT 
GGCAGGAGAA 
GTACTCCAGC 



TTGTTACTGA 
TGGCCATTGT 
CTGAGGAGCT 
ATTTCTCATT 
GTGTGTTTGC 
TCCTCTCCAT 
CTTTGACAAT 
TTGCCAGGTG 
GCCCAGATAT 
CATGCCTCTT 
TCGTCATCAC 
CCAGGTGGTT 
TTGTACCAGA 
GGGGAGACAC 
TGGTGATAAA 
TTTCTAAGAA 
ATGAGTCGCA 
GACAATTACT 
TGAAGACTGA 
TGTCTATACT 
AAGAGGAGAG 
TTTAGATAAC 
AGTGGGGATT 
TCCAGGAGTT 
TCATTATCAG 
AAGTGTATAA 
CACCTGTTTC 
TTCAAATTAC 
TCCCATATCT 
TGGCTATTTT 
ATTCTTCTGT 
TATTAACATA 
TAGAAAATTT 
TCAACTTGCA 
GTTTGTGTTC 
GGCTTACATC 
GAGTTCTAGA 
AGCTGGGCAT 
TTGCTTGAAC 
CTGGGTGACA 



AGAAGTAGAA 
CACCATTGGC 
GCTGCTTTCA 
AGCAGTTCCG 
TGTCTCCAGG 
GATTCATGTC 
GATAATGCTC 
GCTTTTTATT 
GCATCGTCCT 
CATGGTTGCC 
TCTGACTGGA 
TAGAATAATG 
AGAAGATAAG 
AAAATAGGGA 
CAAAAGGAGT 
ATTTAGTTAT 
CAATTCTTGA 
ATGTGGTCAT 
GATTTTTCTG 
GTGAAAAGTG 
AAAGAAATTT 
AAACACTCAT 
GTTGAATACA 
ATGTTTAAGT 
GAAGTTTTAG 
CACATCTTAG 
ACTAATAACT 
ATTATCAACA 
GTAATCATAT 
TACACGATGA 
TAAAATATCT 
AAAATTGCAA 
CCACTTCTAT 
AAAGAGACAA 
AGAAGATGTT 
TGTAATCCCA 
CCAGCCTGAC 
GGTGGCACAT 
CCGGGAGGCG 
AAGTCAGACT 



780 
B40 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



Seq ID NO: 8 Protein sequence; 
Protein Accession &: NPJJS5146 



1 
I 

MVRKPWSTI 
GIFISPKGVL 
LPAFVRVWVE 
SMSVSWSARI 
YGMYAYAGWF 
NAVAVTFSER 
RKHTPLPAVI 
FKVPLFIPAL 
SEKITRTLQI 



11 
I 

SKGGYLQGNV 
QNTGSVGMSL 
LLIIRPAATA 
QIFLTFCKLT 
YLNFVTEEVE 
LLGNFSLAVP 
VLHPLTMIML 
FSFTCLFMVA 
ILEWPEEDK 



21 
I 

NGRLPSLGNK 
TIWTVCGVLS 
VISLAFGRYI 
AILIIIVPGV 
KPBKTIPIiAI 
IFVAliSCFGS 
FSGDU>SI»LN 
LSLYSDPFST 
L 



31 
I 

EPPGQEKVQL 
LFGALSYAEL 
LEPFFIQCEI 
MQLIKGQTQN 
CISMAIVTIG 
MNGGVFAVSR 
FLSFARWLFI 
GIGFVITLTG 



41 

I 

KRKVTLLRGV 
GTTIKKSGGH 
PBLAIKLITA 
FKDAFSGRDS 
YVLTNVAYFT 
LFYVASRBGH 
GLAVAGLIYL 
VPAYYLFIIW 



51 
I 

SIIIGTIIGA 
YTYILBVFGP 
VGITWMVLN 
SITRLPLAFY 
TINAEBXiLLS 
LPBILSMIHV 
RYKCPDMHRP 
DKKPRWFRIM 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 9 DNA sequence 
Nucleic Acid Accession *h FGENESH 
Coding sequence: 1..546 



ATGGCCTTGG 
TTCATGGCTG 
TTGGTGCAAA 
CAGCTCGCCG 
GGCTTTGACT 
TTCTGGTCCT 
CATTCTCCTC 
CAGTTTGTCA 
AAACCCTTTG 
AAGTAG 



11 
I 

GCAGCTCCGC 
GCATTAAGTG 
GGCTCCTGGG 
GTGCCCTCGA 
CCGTGTTGGC 
CAGGAGACAT 
TGAGCACTCC 
AAGGCCAGAA 
AAACTGGTTC 



21 
I 

CCCTGTGGCT 
TCTGTGGCTT 
TGGAGCTCGA 
CCTGCCCGCT 
CTCTCTGCCG 
GTCTGACTGG 
AGGGTGGAGC 
CTTGGACGTA 
CATGGTTCCA 



31 
I 

TTGCAGGGTA 
TTCCAGGTAG 
ACTGAAACTC 
GGGTCCTGTG 
TGGATTTTAA 
GACTACTGGG 
AGGCAGGTTA 
ACAGTGTACT 
TTCACCTTCA 



41 
I 

ATGCCCACTT 
TCCCCCTGGG 
GCTTTGTGCC 
CCTTTGAAGA 
ATGAGGAAGG 
TTGGCTGGCG 
GGCTCCAGTT 
GCAGGCTCCA 
TGTACTGGAT 



51 
I 

CCCTGCTGCT 
GCTCCCCGAG 
CGCAGCCCTG 
GAGCACTTGC 
CCAGCAACCT 
GAAGTTAATT 
GTTCCAGCTT 
GGGCAGTGAG 
CCACCATGGA 
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Seq ID NO: 10 Protein Sequence 
Protein Accession 8: FGENESH 



229 



WO 03/003906 



PCI7US02/21338 



1 XI 21 31 41 51 

I I I I I I 

MALGSSAPVA LGGNAHFPAA FMAGIKCLWL FQWPLGLPE LVQRLLGGAR TETRFVPAAL 60 

5 QLAGALDLPA GSCAFEESTC GFDSVLASLP WILNEEGQQP FWSSGDMSDW DYWVGWRKLI 120 

HSPbSTPGWS RQVRLQLFQL QFVKGQNLDV TVYCRLQGSE KPFETGSMVP FTFMYWIHHG 180 
K 

Seq ID NO: 11 DNA sequence 
10 Nucleic Acid Accession g: EOS sequence 
Coding sequence: 461-3286 

1 11 21 31 41 51 

I I 1 I 1 I 

15 GCCGCCCCTG AGCAGCCTCG CCTTCGCCTC CCGCGTTTCC TGCCGTCCGC CCTCCCCCGG 60 

CCGAGCTCCA GGGGCTGCCG CCTAGCAGCT CCCGGCGGGA GAGCGGTTCA GAGCGCGCAC 120 

GGGGGCGGGC GGAGGCGGCC CGGTGCGGGG CGGCCGCGCT GGAGAGAGGC GCGCGCGGAG 1B0 

ACGCCGGCCC CCCTCTCCGC GTTTGCTCGC TTGCTCCCCG CCTCCCGCAC TCCGCTCGCT 240 

CCCACCCCTT CCCGGCGTGA TTGATCCGTC ACGGGCGCCG CCGCTGCCGC CGCCGCCGCC 300 

20 GCGGCCGTTC TGAGCCGAGC CGGAACCCTA GCCCGAGACG GAGCCGGGGC CCGGGCCGGC 360 

GCCATTGCGC GGGCGCCGCG GGAAGACCTT GGCGCGGGGC GGCGGGCCGG GCCAGGCCAT 420 

GCGGGCCGAG TGAGCCGGCG CCCGCAGCCC GCGGCGOGGC ATGGCTTCCC CGCGGAGCTC 480 

CGGGCAGCCC GGGCCGCCGC CGCCGCCGCC ACCGCCGCCC GCGCGCCTGC TACTGCTACT 540 

GCTGCTGCCG CTGCTGCTGC CTCTGGCGCC CGGGGCCTGG GGCTGGGCGC GGGGCGCCCC 600 

25 CCGGCCGCCG CCCAGCAGCC CGCCGCTCTC CATCATGGGC CTCATGCCGC TCACCAAGGA 660 

GGTGGCCAAG GGCAGCATCG GGCGCGGTGT GCTCCCCGCC GTGGAACTGG CCATCGAGCA 720 

GATCCGCAAC GAGTCACTCC TGCGCCCCTA CTTCCTCGAC CTGCGGCTCT ATGACACGGA 780 

GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA ATAAAATACG GGCCTAACCA 840 

CTTGATGGTG TTTGGAGGCG TCTGTCCATC CGTCACATCC ATCATTGCAG AGTCCCTCCA 900 

30 AGGCTGGAAT CTGGTGCAGC TTTCTTTTGC TGCAACCACG CCTGTTCTAG CCGATAAGAA 960 

AAAATACCCT TATTTCTTTC GGACOGTCCC ATCAGACAAT GCGGTGAATC CAGCCATTCT 1020 

GAAGTTGCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG CTGACGCAAG ACGTTCAGAG ID BO 

GTTCTCTGAG GTGCGGAATG ACCTGACTGG AGTTCTGTAT GGCGAGGACA TTGAGATTTC 1140 

AGACACCGAG AGCTTCTCCA ACGATCCCTG TACCAGTGTC AAAAAGCTGA AGGGGAATGA 1200 

35 TGTGCGGATC ATCCTTGGCC AGTTTGACCA GAATATGGCA GCAAAAGTGT TCTGTTGTGC 1260 

ATACGAGGAG AACATGTATG GTAGTAAATA TCAGTGGATC ATTCCGGGCT GGTACGAGCC 1320 

TTCTTGGTGG GAGCAGGTGC ACACGGAAGC CAACTCATCC CGCTGCCTCC GGAAGAATCT 13 B0 

GCTTGCTGCC ATGGAGGGCT ACATTGGCGT GGATTTCGAG CCCCTGAGCT CCAAGCAGAT 1440 

CAAGACCATC TCAGGAAAGA CTCCACAGCA GTATGAGAGA GAGTACAACA ACAAGCGGTC 1500 

40 AGGCGTGGGG CCCAGCAAGT TCCACGGGTA OGCCTACGAT GGCATCTGGG TCATCGCCAA 1560 

GACACTGCAG AGGGCCATGG AGACACTGCA TGCCAGCAGC CGGCACCAGC GGATCCAGGA 1620 

CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC AATGCCATGA ACGAGACCAA 1680 

CTTCTTCGGG GTCACGGGTC AAGTTGTATT CCGGAATGGG GAGAGAATGG GGACCATTAA 1740 

ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAGGTGGGA GAGTACAACG CTGTGGCCGA 1800 

45 CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA TCCGAACCAC CAAAAGACAA 1860 

GACCATCATC CTGGAGCAGC TGCGGAAGAT CTCCCTACCT CTCTACAGCA TCCTCTCTGC 1920 

CCTCACCATC CTCGGGATGA TCATGGCCAG TGCTTTTCTC TTCTTCAACA TCAAGAACCG 1980 

GAATCAGAAG CTCATAAAGA TGTCGAGTCC ATACATGAAC AACCTTATCA TCCTTGGAGG 2040 

GATGCTCTCC TATGCTTCCA TATTTCTCTT TGGCCTTGAT GGATCCTTTG TCTCTGAAAA 2100 

50 GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC ACCGTGGGCT ACACGACCGC 2160 

TTTTGGGGCC ATGTTTGCAA AGACCTGGAG AGTCCACGCC ATCTTCAAAA ATGTGAAAAT 2220 

GAAGAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC GTGGGGGGCA TGCTGCTGAT 2280 

CGACCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC CTGCGAAGGA CAGTGGAGAA 2340 

GTACAGCATG GAGCCGGACC CAGCAGGACG GGATATCTCC ATCCGCCCTC TCCTGGAGCA 2400 

55 CTGTGAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC TATGCCTACA AGGGACTTCT 2460 

CATGTTGTTC GGTTGTTTCT TAGCTTGGGA GACCCGCAAC GTCAGCATCC CCGCACTCAA 2520 

CGACAGCAAG TACATCGGGA TGAGTGTCTA CAACGTGGGG ATCATGTGCA TCATCGGGGC 2580 

CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG TTCTGCATCG TGGCTCTGGT 2640 

CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC GTGCCGAAGC TCATCACCCT 2700 

60 GAGAACAAAC CCAGATGCAG CAACGCAGAA CAGGCGATTC CAGTTCACTC AGAATCAGAA 2760 

GAAAGAAGAT TCTAAAACGT CCACCTCGGT CACCAGTGTG AACCAAGCCA GCACATCCCG 2820 

CCTGGAGGGC CTACAGTCAG AAAACCATCG CCTGCGAATG AAGATCACAG AGCTGGATAA 2080 

AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA GAAAAGACCA CCTACATTAA 2940 

ACAGAACCAC TACCAAGAGC TCAATGACAT CCTCAACCTG GGAAACTTCA CTGAGAGCAC 3000 

65 AGATGGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA AATCCCCAGC TACAGTGGAA 3060 

CACAACAGAG CCCTCTCGAA CATGCAAAGA TCCTATAGAA GATATAAACT CTCCAGAACA 3120 

CATCCAGCGT CGGCTGTCCC TCCAGCTCCC CATCCTCCAC CACGCCTACC TCCCATCCAT 3180 

CGGAGGCGTG GACGCCAGCT GTGTCAGCCC CTGCGTCAGC CCCACCGCCA GCCCCCGCCA 3240 

CAGACATGTG CCACCCTCCT TCCGAGTCAT GGTCTCGGGC CTGTAAGGGT GGGAGGCCTG 3300 

70 GGCCCGGGGC CTCCCCCGTG ACAGAACCAC ACTGGGCAGA GGGGTCTGCT GCAGAAACAC 3360 

TGTCGGCTCT GGCTGCGGAG AAGCTGGGCA CCATGGCTGG CCTCTCAGGA CCACTCGGAT 3420 

GGCACTCAGG TGGACAGGAC GGGGCAGGGG GAGACTTGGC ACCTGACCTC GAGCCTTATT 3480 

TGTGAAGTCC TTATTTCTTC ACAAAGAAGA GGAACGGAAA TGGGACGTCT TCCTTAACAT 3540 
CTGCAAACAA GGAGGCGCTG GGATATCAAA CTTGCAAAAA AAAAAAAAAA AAAAAAAAAA 3600 

75 CAAAAAAACT AGACAAGGAG AGAGGCACTA GAACTCCAGC TGGAAGTCAC GGAGTGGCTC 3660 

GAGCAGCCTT GGGAAGAGGC AAGGAGCTTC TGAAGAAACT GCCTCTGCAC ACACATCACT 3720 
GGCTGTGACC CCTCAGGCTA GCCCTTCTCC ACTCTGGGGG AGGAGGTGGG AAGGGCCACC 3780 
AGGCCCCCAG CTGCCAGGCC AGCTGACCCC AGCCTTCCTG GAACAGGGAG TCTGCAGGAG 3840 
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CGCAGACAGG CACAGCCCTG GAGCAGGCAG GCCGAGGGCT GCGGCACTGG AGCAGGCTGA 3900 

CTTACATGCT CCACATGGGA CCTGTGTCAC CCAATGAGAT GTTTGTTACT CTGGTAAATG 3960 

CCACACGTTA ACACAATAAC ACCCATTCCT GGGACCGTGG GGATTTAGGG CACGTCACTG 4020 

CAGACACGCT CTGCAGCATT CACCGACAGT CTGTCATGCA CCCACCACGT TGGCCATGTC 4080 

5 CTTGTGTTCC TATCGGATGC TCCCAGTAAC CAGGGGGACC ACCOGAGCTA ATCATGGAAT 4140 

GTCTGTTCCC AGCAAACACG ATAAAGAAAG ATTGTGCACT TTAACCTCTC TCATCAGGGC 4200 

CCAAGGGCTG GCTGGGATTT TTTTTTTTTT TTTCCCACTA ACTTTGTTTC TGACCAAAGT 4260 

GAATTGGAGG CACTCTGCTA AAAGACATCC CCGTAGACAT AGGGGAGAGA GTTGCTGGCT 4320 

GAGGGCTTCC CTTGGCTTCC AGAAGGCAGC CTTCCATCCA GACAAGCCAG TGAGCTCTCC 4380 

10 CCTTGGGATC ACTGGGGTGA TCAGTCAGCA GATTGATTCT CATTCATAAG ATCATTCCTC 4440 

CCTTTAAATT GAGCCCCTAA GAGCACTGGC CTGGGAGTCA GACAGACCTG GGTTCAAGTC 4500 

CTCAGTCCCC TGCCCACTCC CTACGTGACT TTGATCAGGT CACTAGTGTC TCTCTGAGCC 4560 

TCAGTTTCCC CTCTGTAACT TGGGGTTGAA CTAAAACACC TGTCCTGCCT ACCTCACAAG 4620 

GTCACTCTGA GGATTGAAAC TTGATCTTGT CCAGGAAAGC TTTGTACCAA ACAGTGAAGC 4680 

15 CGCCCTGATC CGTGAGGTAT GAGTATGACT CTGACCTTCA GCCCTCCCTA CAGCCGGGGG 4740 

TGTGGCCCAG AGAAGCTTCC AGCACAGCCC TCTACCCAGA ACATCCGGGC TGGAGGGAGG 4800 

CTCCCAGTGA CTTTTCTGAC ATTCCTAGAC AGGTTCATTC TTTGCTCAAG AAAGGCCTGA 4860 

ATGACAATGT CCAGGATGTC TGCACAACTG AGCAGCTCGC TCACTCCCTA AAGAAACCTA 4920 

TTGGCAGCTT CAACAGGCAG GCAATAATCT CTTCCCAGAA CCACTGCAGT CAGGAATAAA 4980 

20 CTGTTTTCTC CACCAGGCTT TGACAAAAGG GCCCACAGGA ATCTTACCAA TGCCAACATT 5040 

TCAAAGCACC CTATTTCACG TAGCATAGCT TTCTGCTCCC CTTCCCCAAA GAGAGGTTAT 5100 

GGAGGTACTG TAGCTTTTAG GGAAAAAAAA ATGTTAACAC ATCACAGGTC AAGTTGAAGT 5160 

CATTCTCTGT TTAGGCACTA AAAATCGGTG TTGTCACTCA CTGTGTATTA CCAGTATTTA 5220 

CTTGCTTTCT TGATTTCACC AAAACCAAAT TTAATTTAAA GGACCACATT AATTTTTCAA 5280 

25 AGGGAAAGAG ACAATTAATT GTACATAATG TATACACACA CACAAAAAAA AATACCTGTA 5340 

GAAATATTAT TCCAGCATAG CAGGAAAACA AACAAAAGTA TTGGACTGTC GGAGGTGAGC 5400 

CTGTGCGTCT GTAACCCTTT GTGACTCCTG AGCGTGCGCT GTCTTCTAGG TTAACTCACG 5460 

AAGTACATTC TCTGTCTTAC TGATACTGTA GGTTCACCCA TTTTTTTTTA ATTTCCTCGC 5520 

AAATAACAAG ACCCACAGAA GTGACTCTAG CTACTTAATG GTTCTGTTCT TTTATATGCA 5580 

30 GCAAACACAC CGTCCATTTC TGAAGAGGCT TCGGCCTGAA GGCATTTTCC AATGATGTTA 5640 

GTGCACAAAA CGCTTTAAAT TAGACTGGAA CTGCCAGAAT CAAATGTAAA TGAGGAATTT 5700 

CTCGTACCCC TACTGCATGG TATCGATTTT TAATAAATTG TTGCAAATTT GTTTTTATGA 5760 
ATAAAAGGAA AAAACCTGTC GTCTTT 

35 Seq ID NO: 12 Protein sequence: 

Protein Accession #: EOS sequence 

1 11 21 31 41 51 

40 MASPRSSGQP GPPPPPPPPP ARLLLLLLLP LLLPLAPGAW GWARGAPRPP PSSPPLSIMG 
LMPLTKEVAK GSIGRGVLPA VELAIEQIRN ESLLRPYFLD LRLYDTECDN AKGLKAFYDA 
IKYGPNHIHV FGGVCPSVTS IIAESLQGWN LVQLSFAATT PVLADKKKYP YFPRTVPSDN 
AVNPAILKLL KHYQWKRVGT LTQDVQRFSE VRNDLTGVLY GEDIEISDTB SPSNDPCTSV 
KKLKGNDVRI ILGQFDQNMA AKVFCCAYEE NMYGSKYQWI IPGWYEPSWW EQVHTEANSS 

45 RCLRKNLLAA MEGYIGVDFE PLSSKQIKTI SGKTPQQYER BYNNKRSGVG PSKFHGYAYD 
GIWVIAKTLQ RAMETLHASS RHQRIQDFNY TDHTLGRIIL NAMNETNFFG VTGQWFRNG 
ERKGTIKFTQ FQVSREVKVG EYNAVADTLE IINDTIRFQG SEPPKDKTII LEQLRKISLP 
LYSILSALTI LGMIMASAFL FFNIKNRNQK LIKMSSPYMN NLIILGGMLS YASIFLFGLD 
GSFVSEKTFE TLCTVRTVJIL TVGYTTAFGA MFAKTWRVHA IFKNVKMKKK IIKDQKLLVI 

50 VGGMLLIDLC ILICWQAVDP LRRTVEKYSM EPDPAGRDIS IRPLLEHCEN THMTIWLGIV 
YAYKGIiLMLF GCFLAWETRN VSIPALNDSK YIGMSVYNVG IMCIIGAAVS FLTRDQPNVQ 
FCIVALVIIF CSTITLCLVF VPKLITLRTN PDAATQNRRF QFTQNQKKED SKTSTSVTSV 
NQASTSRLiEG LQSENHRLRM KITELDKDLE EVTMQWJDTP EKTTYIKQNH YQELNDILNL 
GNFTESTDGG KAILKNHLDQ NPQLQWNTTE PSRTCKDPIE DINSPEHIQR RLSLQLPILH 

55 HAYLPSIGGV DASCVSPCVS PTASPRHRHV PPSFRVMVSG L 
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Seq ID NO: 13 DNA sequence 

Nucleic Acid Accession #: NM_001565.1 

Coding sequence.- 67-363 

60 

1 11 21 31 41 51 

GAGACATTCC TCAATTGCTT AGACATATTC TGAGCCTACA GCAGAGGAAC CTCCAGTCTC 60 

AGCACCATGA ATCAAACTGC GATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 

65 ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT T AGTAA TCAA 180 

CCTGTTAATC CAAGGTCTTT AGAAAAACTT GAAATTATTC CTGCAAGCCA ATTTTGTCCA 240 

CGTGTTGAGA TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT GAATCCAGAA 300 

TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 

TAAAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 

70 CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 

GTTACACTAA AAGGTGACCA ATGATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGGAA 540 

GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 

GCTCTACTGA GGTGCTATGT TCTTAGTGGA TGTTCTGACC CTGCTTCAAA TATTTCCCTC 660 

ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 

75 TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 

ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 

CATACAATTC CAAACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT 900 

CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
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TTTCAGTGTA CATGGAATAA CATGTAATTA AGTACTATGT ATCAATGAGT AACAGGAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 

5 Seq ID NO: 14 Protein sequence; 
Protein Accession #: NPJJ01556.1 

1 11 21 31 41 51 

10 MNQTAILICC LIFLTLSGIQ GVPLSRTVRC TCISISNQPV NPRSLEKLEI IPASQFCPRV 60 
EIIATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 



15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



Seq ID NO: IS DNA sequence 

Nucleic Acid Accession Eos sequence 

Coding sequence: 241.. 1272 



11 



1 
I 

GCGCAGACGC 
CTCTACAAAG 
GTGCCCACCT 
TTGAGGGCCA 
ATGGTGACAG 
GAGCACTTCT 
CCTGCTCTGC 
GTGGTGGGCC 
ATCACACCAA 
GAGCGTGCGC 
TACAACAATG 
TCCGACGCCT 
AGCCTGGGCT 
GAGCAGGGCG 
CAGGATGTGT 
GCCACGCCCA 
GCCCGCGCTG 
GCCGGACTCC 
GGCGGCCTGC 
ACTGCCGCCC 
TGCGAGAGGA 
ATATTCTCCT 
ACCTGTTTTC 
TACTCGCTCG 
AGAGCCTCTG 
AGAGTTACAA 
TTGTTTTGCC 
TGTGTCACTC 
ACCTGTATCT 
CTAACATGAC 
TTTCCAGCAG 
GGGGAGAGAA 
CAAAGCTGGG 
GTTTC CCTT C 
CTTTCCTTTC 
GTCTTGTTCC 
CTTTGGGGAC 
ATTCCTCTTC 
ATTGATGGAA 
AAGGGTCAAA 
TGGATTTTCT 
AGGAAAGGGC 
GTGCCCGACT 
TTGAATCCTT 
TGGTTGAGGA 
GGAGGTGGGC 
TTGCTGCTTC 
TGGCCGGGCC 
GTGGGACTGT 
ATGGAGCAGG 
CACCTCTCCA 
CAGCCAAGAA 
GATGTCTGTA 
GGAGGGGGGT 
CCCACAAAGG 
AATTGAGCCA 
AGAAAGAGAG 
TAGCAAAAAA 
GTTTATTACC 
ATTAGGTAAT 



21 31 41 51 

ill) 
CCCAGCCCCC CACCGCCCCC AAAGGGGCGA GCGACGCCAA GCTCTGCGCT 
AGGCOGAGCT GCGCCTGAAG GGCAGCAGCA ACACCACGGA GTGTGTTCCC 
CCGAGCACGT GGCCGAGATC GTGGGCAGGC AAGGCTGCAA GATTAAGGCC 
AGACCAACAC CTACATCAAG ACACCGGTGA GGGGCGAGGA ACCAGTGTTC 
GGCGACGGGA GGACGTGGCC ACAGCCCGGC GGGAAATCAT CTCAGCAGCG 
CCATGATCCG TGCCTCCCGC AACAAGTCAG GCGCCGCCTT TGGTGTGGCT 
CCGGCCAGGT GACCATCCGT GTGCGGGTGC CCTACCGCGT GGTGGGGCTG 
CCAAAGGGGC AACCATCAAG CGCATCCAGC AGCAAACCAA CACATACATT 
GCCGTGACCG CGACCCCGTG TTCGAGATCA CGGGTGCCCC AGGCAACGTG 
GCGAGGAGAT CGAGACGCAC ATCGCGGTGC GCACTGGCAA GATCCTCGAG 
AAAACGACTT CCTGGCGGGG AGCCCCGACG CAGCAATCGA TAGCCGCTAC 
GGCGGGTGCA CCAGCCCGGC TGCAAGCCCC TCTCCACCTT CCGGCAGAAC 
GCATCGGCGA GTGCGGAGTG GACTCTGGCT TTGAGGCCCC ACGCCTGGGT 
GGGACTTTGG CTACGGCGGG TACCTCTTTC CGGGCTATGG CGTGGGCAAG 
ACTACGGCGT GGCCGAGACT AGCCCCCCGC TGTGGGCGGG CCAGGAGAAC 
CCTCCGTGCT CTTCTCCTCT GCCTCCTCCT CCTCCTCCTC TTCCGCCAAG 
GGCCCCCGGG CGCACACCGC TCCCCTGCCA CTTCCGCGGG ACCCGAGCTG 
CGAGGCGCCC CCCGGGAGAG CCGCTCCAGG GCTT CTCT AA ACTTGGTGGG 
GGAGCCCCGG CGGCGGGCGG GATTGCATGG TCTGCTTTGA GAGCGAAGTG 
TTGTGCCCTG CGOACACAAC CTGTTCTGCA TGGAGTGTGC AGTACGCATC 
CGGACCCAGA GTGTCCCGTC TGCCACATCA CAGCCACGCA AGCCATCCGA 
AAGCCCCGTG CCCCATGCCT CCGGGGCCCA CTCCACTGGG CCCACCCTGG 
CACTAAAGCC TTTTGGAAAG CGGTGATTTG AGGGGCAAGG TGCTTAGAGA 
CTGGGGAAGG GGGGAGGGAG GCAGTGGTGG CTGGAGGGTG CGCCACTTTC 
GTCACCCTGT CCTGGAAAGA TTGGGAGGGG GCCAGACTGA AAATTTTACT 
CTCTGATACC TCAACACACC CTTAAATCTG GAAGCAGCTA AGAGAAACTT 
AGAGGTGGCC ACTAAGGCAT TCTGACGCCC TCTGCCCACC TCCCCCGCTG 
CACCCCTTCT TCCGAGGAGG GGGTGGGTAA AAGGGAGAGG GAGAATTACC 
AGAGGTGCTC TTTGCAATCC CTAAGCCCTC TGGTCCTGAC CTCCGACCTC 
CCTTTACCTC CCACCCCACC CCCATATCCT GTTTGGGAAA CTGTCACCAG 
TGTAAGGGAG TTGGAGTCCT ATCAGAAGTT GCATAGATCT TCTAGGGGTT 
GCATGTCAAT CGTTTCTGTG GCTGAAAGGC TCAGAAGCCA TCTGTCCCCA 
CTAGAGGAAT CTGGAGAGGA GTCCTCCTCT CTGCCCCTGT CCCCTGCAGT 
ACTCTCTCCG CCTATCTTCC CTTCCTTTGG GATCTTCCCT TTCCTCAACT 
CCTCCAGCTC TTTGCTTTGC TTTCTTTTGG TGGCTGTCAC TCCCAGCTCT 
TTGTCTTTGT CTTTCTTCCC TTCCCCCTGC CCCTGCCCCT ACCAGCCCAG 
ACCATCCTTC TGGGGAGAAG TAGGGGGAGG AATATTTGGA TGGTCCCTCC 
AGGCATCTGG AGGCCCTCTC CCCCACTCCT CCAAAGAAAC ATCTCAAATT 
TGTATCCCCA TTCTCAGTGA AAATGTGAGG AGGGGACTAA TACTGGGGTA 
CCCCCACCTT CATCACTATG GGCATTATAT TTAGGGAGTA GTTCTTGGGC 
GGTTGTGGAA GTGGGGGCGC CAGAGTAGTG TGT CTGCTAT TTAAAGGAGC 
GTGAGGCAGG AGGAQAGACT GGTGGAGGGA AGAGCTGCTC CTCCCATGCA 
CCCTGCACCC CTCTCAACCT GACCTGAACC TTTATTGAAT CCTTATTAGC 
ATTAGCTTGA ATCCTCCATG CAAATCATGG AGTCTGTGTC CCACCTGATG 
GAAGCCAGGT CTTCAAAGAG GGGTCAGCCT GGGGCAAAGC AGGACTGGGG 
AGCAGGGCCT ATTCTGAGAA TCACATATTG TTACAGGCCT TGCACCCCCT 
CCTCCTGCTC ATTTGGGGCT GCCACCAGCT CTCCACCCTC CTGGTTCCGC 
AAGAGAGGAT GGAGGGATGG GAGTCCCAGG AGATCCTTGT AAATAGTGGG 
TCTGAGTGAT CACCCGAGCA CTTAAAGCTC CAGAGTCCCA TTCTTCCTGG 
TGGAGGTGCA GAGGGGATTT CCTCCTCTCC TTCCTCCTGT CGAGAATTAA 
CAGCCTTCCC CTCCAGAACA CCAGCCAGGG AGGGGTGGGG AAGGAGGTCA 
AACTGCCCTG TGACGACTTC CCTCCTTCCC GCCTATGTGA GCCATCCTGA 
CAATAGAAAC CAAACCAAAT GGGCACCCTC GGTTGCCGGG GGGCAGGTGG 
GGGAAGAAGG GATGTCTGTC TGTCGTCCCC CTCCCCCTCT CCACTCTTTA 
CAGAAGACTG TTACACTAGG GGGCTCAGCA AATTCAATCC CACCCTTACC 
AACCTAGAAA CAAACACAAA ACACGAATAG TGAGAGACAA AATAGAGGAG 
CATGAGAGGG AGCGAGACAG GCGACCAACA CAGAGGAGAG AAAACAAAAA 
AAAAAAAAAA AGCAGTTCTT TATAATTTAA TATTCTATTT TAATAAAGGC 
ATATAAATGT AGCAAAGAAC CTGGGCTAAT ATGAAAAAAA AAGACTTTTT 
TTATTATATG AAAAGGATAT TTTATTTTAT GATAAAGTGA TCCTTAAAAA 
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AATAAAAAAA CTTTAGAAGG TTTAGAATAT ATGTAGGGAG AGAAGAAGAA AAAAATACAT 3660 

TTGTATTCAG AGTTAAATCT TAAAAAAAAA AAGTGTTTTT AATATATGTT TGGGTTTACG 3720 

TTGCTTTTTT CCCCCACTTT TTTTTTGGGG AGGAATGTCA TTTGCTTTTC TTGGGGGAGC 3780 

ATCCCGGGGG TGAATGGTGG AGAGAGGAGC TGGGGGAACC CGGTCCCTCC TGGGACCCTT 3840 

CCAGTAGATT GGATTTCACT CCATGGACTC CTCCTCCCCT CTCCCCCTCC CCCTCAGGGG 3900 

AGCCGGCAGA GCCAAACAAA GAAAGGGATT AACAAGAAAG GAAGAAGCTG TAGGACTAAG 3960 

GACTGAGGAT CCTGGGGTGT CCCCCACCAC TTTCCCCTGC CCTGTCGCAG GGGCAAGTGA 4020 

GGAGGGGGAA TCCAGAATTA AGGCCTAGCA GGCCTATAGG AACCCTCAGA GATGTGTGAG 4080 

ATTTAAGAGA TCTAGATTTT TTTTTAACCA AAAACAAGAG AGAAAGAGAA GAAAAAGAGA 4140 

AACCGAGGGG TTTAAAAGAA AAGAATACTA CAAAATAATA ATTATTAATA ATAATAATTC 4200 

AAATTTATTT CATATAATCC TAGAGAGAGA AAGAAACAAT TACTAGTTAC TTAGTAGACA 4260 

ATATTAAGAT AGCTTAAAGT TTAGTAGCAT TGAGGGCCCC TGGGTCCAGT AGAATGTATA 4320 

AAAGTTGTAA GGAAAAGATA AATAGAGGAG GGAAGTGGCT GAGTCCACCC TGAGTTGCCC 4380 

AATCTTCAGA TACCAGGGTT GGATCAGGTT GCTAGTTTAA GATTGGGAGC TTCCAGTCTG 4440 

CTGGGGTTGA TTCTGAGAAT CCTTGGATTT TTAAATTGTA GGACAAAGAA ATGAGGGGTT 4500 

CATTTCCCAG GGTCTTGGAA AGGATGCACA CTGATCATCT CAATAAGACA GGGGCTGGGT 456 0 

TGGGGGCAGC AGAGGAGGCC AAGCACATTC ACCTGCACCC CTAGTACCTG GGCAGCCCAT 4620 

ACTCCAATGT GGTATGTCCC CTCCTGGGGC TCCCAGCTCA AACCCTCCCA TGCCTGCTTC 4680 

CCCCAGGCCT AACTGAGGAA GTCCTTCTTG AAGTGTGACC TCGGTCCACT TCTCTACAGA 4740 

TTGATTTAAG AGCCTGGGAA GTCATTCCAC AAACAGACAC ACATGCACAC ACGCTTCTCA 4800 

CCTTCAGAGC TTCAAGAGCA CTGAGGCGAT CAGTCCCCTA CCCCTGTTCC CATCCA GCTT 4860 

TCCACTTAGC TTTGACCTCC ATGGCAGCAG TAGCAGTAAC AATCTCAGTA ATTGTTCTTT 4920 

AAAGCTGACT CGTTCTTCAC CTACTTGCAA AGTGCTTTCT TGTCTCATAA AAGTTAGATT 4980 

CCAAGAAGGA CTTCCCACGG AGTGGAGTGG AAACACTGTC CTTGAAGGCC TGGGAGAAAG 5040 

GCATCCCCAT GGGCACAGAG GCTGGGGAAA GGCACAGGGA CTTTGGGTGA CCCTAACCCT 5100 

GACCCTCTGC TCCAGTTCAC CTCCATCTAT ATGTGTTCAG GTAGGGGTCA TCTACTGTAC 5160 

CCTGGCCTGG GAACACATTG CCCTCCCCAC ACAAAACTGG AGGGCTTGGC TTCTGCGTGT 5220 

GAGAAATCAA CATTTTTAAA GCACTTGCCT TCTACCAACC CCAGCTTGCA ATCACTGGGC 5280 

CTTCCCCTCC TATCCAAGGG GTTGGAGGGG CCCCTTGGCT CTCCTTTTGG CAGGAGGAGC 5340 

CTGCTTCATT ACACCAATGA CTCTGCCATC CCCCTCCCTG GCCCTAGACC CCAAACACAT 5400 

CTCCCTCTAC CCAATTTACT CTTCTCGCCC CACCTAGGGA CAGATTCCCC CTGCTCTTTT 5460 

TGTCCTAGAA ACCCCGCTAG TTTGGGATGG TAGCGTCTGG GGTGGGGAGG GCTTCCCCTT 5520 

CCCCACTCGA GGGTGCGGGT GGGGAAGGGG GGGTGGGTGG AGACAGCCCT GGGGCAGGGA 5580 

GGATGGTCTC TCCACTGTAG AAAGTAGAGT AGGATTGTGG TCAGACTTAA TTTGAGGCAT 5640 

CTAGTGAAGA CACGTACAAA TCCACCAAGG AAAAAGATTT CAAAAGCAAA ATAAAAGOGG 5700 

GAAATAAAAC AGACCCAAGA ATAATCAAGT CAAAGTGATG TTGCACAAAA TGCAGAGAAA 5760 

CCAAGAAGGG GGAGGGTTAA TGTATTAAAT GTGCTATTAA GAACTTAATT TTATTAAAAG 5820 
TACTATTACT TAAGGCTC 

Seq ID NO: 16 Protein Sequence 
Protein Accession XP_044166 



1 11 21 31 41 51 

I ) | ] t I 

MVTGRREDVA TARREI ISAA EHFSMIRASR NKSGAAFGVA PALPGQVTIR VRVPYRWGL 60 

WGPKGATIK RIQQQTNTYI ITPSRDRDPV FEITGAPGNV ERAREEIETH IAVRTGKILE 120 

YNN END FLAG SPDAAIDSRY SDAWRVRQPG CKPLSTFRQN SLGCIGECGV DSGFEAPRLG 180 

EQGGDFGYGG YLFPGYGVGK QDVYYGVAET SPPLWAGQEN ATPTSVLFSS ASSSSSSSAK 240 

ARAGPPGAHR SPATSAGPEL AGLPRRPPGE PLQGPSKLGG GGLRSPGGGR DCMVCFESBV 300 

TAALVPCGHN LFCMECAVRI CERTDPECPV CHITATQAIR IPS 343 

Seq ID NO: 17 DNA sequence 
Nucleic Acid Accession #: NM_005940 
Coding sequence: 23-1489 

1 11 21 31 41 51 

AAGCCCAGCA GCCCCGGGGC GGATGGCTCC GGCGGCCTGG CTCCGCAGCG CGGCCGCGCG 60 

CGCCCTCCTG CCCCCGATGC TGCTGCTGCT GCTCCAGCCG CCGCCGCTGC TGGCCCGGGC 120 

TCTGCCGCCG GACGTCCACC ACCTCCATGC CGAGAGGAGG GGGCCACAGC CCTGGCATGC 180 

AGCCCTGCCC AGTAGCCCGG CACCTGCCCC TGCCACGCAG GAAGCCCCCC GGCCTGCCAG 240 

CAGCCTCAGG CCTCCCCGCT GTGGCGTGCC CGACCCATCT GATGGGCTGA GTGCCCGCAA 300 

CCGACAGAAG AGGTTCGTGC TTTCTGGCGG GCGCTGGGAG AAGACGGACC TCACCTACAG 360 

GATCCTTCGG TTCCCATGGC AGTTGGTGCA GGAGCAGGTG CGGCAGACGA TGGCAGAGGC 420 

CCTAAAGGTA TGGAGCGATG TGACGCCACT CACCTTTACT GAGGTGCACG AGGGCCGTGC 480 

TGACATCATG ATCGACTTCG CCAGGTACTG GCATGGGGAC GACCTGCCGT TTGATGGGCC 540 

TGGGGGCATC CTGGCCCATG CCTTCTTCCC CAAGACTCAC CGAGAAGG GG ATGTCCACTT 600 

CGACTATGAT GAGACCTGGA CTATCGGGGA TGACCAGGGC ACAGACCTGC TGCAGGTGGC 660 

AGCCCATGAA TTTGGCCACG TGCTGGGGCT GCAGCACACA ACAGCAGCCA AGGCCCTGAT 720 

GTCCGCCTTC TACACCTTTC GCTACCCACT GAGTCTCAGC CCAGATGACT GCAGGGGCGT 780 
TCAACACCTA TATGGCCAGC CCTGGCCCAC TGTCACCTCC AGGACCCCAG CCCTGGGCCC 840 
CCAGGCTGGG ATAGACACCA ATGAGATTGC ACCGCTGGAG CCAGACGCCC CGCCAGATGC 900 
CTGTGAGGCC TCCTTTGACG CGGTCTCCAC CATCCGAGGC GAGCTCTTTT TCTTCAAAGC 960 

GGGCTTTGTG TGGCGCCTCC GTGGGGGCCA GCTGCAGCCC GGCTACCCAG CATTGGCCTC 1020 

TCGCCACTGG CAGGGACTGC CCAGCCCTGT GGACGCTGCC TTCGAGGATG CCCAGGGCCA 1080 

CATTTGGTTC TTCCAAGGTG CTCAGTACTG GGTGTACGAC GGTGAAAAGC CAGTCCTGGG 1140 
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CCCCGCACCC 
GGGTCCCGAG 
CAGCACCCGG 
CTCTGAGATC 
CCTCTACTGG 
GGGTCCTGAC 
ATGCCCTCAG 
ATCTTTGTGG 
GGTGGGGTAC 
AGCGACTGTC 
GGGACCCGCT 
GTAGCACCAT 
TCCTTCCAGG 
TGAGCAACTG 
ATCTGTCTGC 
GTTCACAGTC 
CAACATACCT 
ATCCTCCAAA 
TTTTTAAACT 



CTCACCGAGC 
AAGAACAAGA 
CGTGTAGACA 
GACGCTGCCT 
AAGTTTGACC 
TTCTTTGGCT 
GGGTGCTGAC 
CTGTGGGCAC 
AACCACCATG 
TCAGACTGGG 
ATGCAGGTCC 
GGCAGGACTG 
GGCTGGCACT 
GGCTGTAGGG 
CTTCTGGCTG 
AAATGGGGAG 
CAATCCTGTC 
GCCATTGTAA 
GAGGATTGTC 



TGGGCCTGGT 
TCTACTTCTT 
GTCCCGTGCC 
TCCAGGATGC 
CTGTGAAGGT 
GTGCCGAGCC 
CCCTGCCAGG 
CAGGCATGGG 
ACAACTGCCG 
CAGGGAGGCT 
TGGCAAACCT 
GGGGAACTGG 
GAAGCAAGGG 
CAGGGCCACT 
ACAATCCTGG 
GGGTATTCTT 
CCAGGCCGGA 
ATGTGTGTAC 
ATTAAACACA 



GAGGTTCCCG 
CCGAGGCAGG 
CCGCAGGGCC 
TGATGGCTAT 
GAAGGCTCTG 
TGCCAACACT 
CCACGAATAT 
ACTGAGCCCA 
GGAGGGCCAC 
TTGGCATGAC 
GGCTGCCCTG 
AGTGTCCTTG 
TGCTGGGGCC 
TCCTGAGGTC 
AAATCTGTTC 
CATGCAGGAG 
TCCTCCTGAA 
AGTGTGTATA 
GTTGTTTTCT 



GTCCATGCTG 
GACTACTGGC 
ACTGACTGGA 
GCCTACTTCC 
GAAGGCTTCC 
TTCCTCTGAC 
CAGGCTAGAG 
TGTCTCCTGC 
GCAGGTCGTG 
TTAAGAGGAA 
TCTCATCCCT 
CTGTATCCCT 
CCATGGCCTT 
AGGTCTTGGT 
TCCAGAATCC 
ACCCCAGGCC 
GCCCTTTTCG 
AACCTTCTTC 



CCTTGGTCTG 
GTTTCCACCC 
GAGGGGTGCC 
TGCGCGGCCG 
CCCGTCTCGT 
CATGGCTTGG 
ACCCATGGCC 
AGGGGGATGG 
GTCACCTGCC 
GGGCAGTCTT 
GTCCCTCAGG 
GTTGTGAGGT 
CAGCCCTGGC 
AGGTGCCTGC 
AGGCCAAAAA 
CTGGAGGCTG 
CAGCACTGCT 
TTCTTTTTTT 



1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



Seq ID NO: 18 protein sequence: 
Protein Accession »: NP_005931.1 



MAPAAWLRSA 
PAPATQEAPR 
LVQEQVRQTM 
FFPKTHREGD 
YPLSLSPDDC 
VSTIRGELFP 
QYWVYDGEKP 
PVPRRATDWR 
AEPANTFL 



11 
I 

AARALLPPML 
PASSLRPPRC 
AEALKVWSDV 
VHFDYDETWT 
RGVQHLYGQP 
FKAGFVWRLR 
VLGPAPLTEL 
GVPSEIDAAF 



21 
I 

LLIiLQPPPLL 
GVPDPSDGLS 
TPLTFTEVHE 
IGDDQGTDLL 
WPTVTSRTPA 
GGQLQPGYPA 
GLVRFPVHAA 
QDADGYAYFL 



31 
I 

ARALPPDVHH 
ARNRQKRPVL 
GRADIMIDFA 
QVAAHEFGHV 
LGPQAGIDTN 
LASRHWQGLP 
LWJGPEKNKI 
RGRLYWKFDP 



41 
I 

LHAERRGPQP 
SGGRWEKTDL 
RYWHGDDLPF 
LGLQHTTAAK 
EIAPLEPDAP 
SPVDAAFEDA 
YFFRGRDYYIR 
VKVKALEGFP 



51 
I 

WHAALPSSPA 
TYRILRFPWQ 
DGPGGILAHA 
ALMSAFYTFR 
PDACEASFDA 
QGHIWFFQGA 
FHPSTRRVDS 
RLVGPDFFGC 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession &: NM_005794.1 

Coding sequence: 434-1276 



11 
I 

CACGCTGTGA 
GGGTCCGTGC 
GGGGCCAGCA 
AAGAACCGTA 
GAACCCATCA 
TGGTGAGACT 
TGAGACTATC 
ACTATGCTGT 
TCTGTGAGGA 
GTGGTCACGG 
GGGGCCCACG 
CTGCAGGGGG 
CGGGAGCAGC 
AGCGCAGGGG 
AAGATCCTAA 
ATGGAGAACA 
GTGGCGCTGG 
GCATTGGAGC 
ACTGACTTCA 
CATCAGCTGC 
TCTCCAGATG 
CTCTGAGAGG 
GTCTAGGTGA 
GGGCTTACTC 



GGTTCCCTTC 
CTCACTCGTT 
CTTCACTCCT 
CACCATCTTT 
GTGAGGCCAA 
TATCACCAAG 
CACCAAGCGG 
GACACCAACC 
TGCTAGGCTT 
CCGGGTAGCC 
GGCCCGGGAC 
CATGGCCAAG 
GGCTGAGGAC 
CCTGGTGTGC 
GATCTGGGAC 
GCTGCCCTAC 
TAATCCAGTA 
TAGAACACTG 
AATTATAAAA 
CAAGGAACAT 
CTTCCTGTGC 
CTCCACTCGG 
CTGAGGGGGT 
AGCAATTTGG 
CC 



Seq ID NO: 20 Protein sequence: 
Protein Accession 8: NP.005785.1 



21 
I 

AGCTTTGTTC 
CACCTTTAAG 
AGACCACGAA 
ATACTCACCG 
ATTCCGTACA 
ATTGCCAAGC 
ACCTATCGCC 
CAGCAGTTGC 
TGAGCAGCAC 
GGTCCACCAG 
TGGTCATCAG 
AGGGGCTGAG 
TGGTGGCCAA 
TCAACCCTCT 
GTGTGAACGT 
GGAGGGGTGC 
GTGTCTACAA 
TGGCCCCCAA 
GCAAAGTGTT 
AGAGGATTGG 
CCAGCTACGT 
AGTGGGGGCG 
TCATTTGGAT 
ATGCTAGGCT 



31 
I 

TTTTGGTCTT 
AGCTGTAACA 
TGCACCGAGA 
CAAGGGTCTG 
CATTTTGGTG 
AGTGAGACTA 
AAGTGGCCTG 
CCGGGGCTAC 
CGGGATAGAC 
TGGGATCGGC 
CAGCCGGAAG 
TGTGGCGGGC 
GGCCCTGGAG 
GGTAGGGAGC 
GAAGTCCCCA 
TGTCATCCTG 
TGTCAGCAAG 
GGACATCCGG 
TCATGGGAAT 
GGAGTCAGAG 
CAACGGGGAG 
GCTGCGTAGC 
CTGGAGCAGA 
TGAGGAAGAA 



41 
I 

CATGATAAAT 
CTCACCGCGA 
GGAATGAACA 
CAACTTCATT 
ACTTTGAAGA 
TTGCCAAGTG 
ATTCAGCAGG 
CAGGGCTGGT 
AGGAAGGGCG 
TTTGCCATCG 
CAGCAGAACG 
ATTGTGTGCC 
CACTGTGGGG 
ACTCTGGGGA 
GCCCTGCTGC 
GTCTCTTCCA 
ACAGCGCTGC 
GTAAACTGCG 
GAGTCTCTCT 
GACTGTGCAG 
AACATTGCGG 
TGTGGTCCCA 
GTCTGCCATT 
GAAAAACGCT 



51 
I 

CTTGCTGCTG 
AGGTCTGCAA 
ACTCTGGACA 
CTTGAAGTCA 
GACTGTCACC 
GTGAGACCAT 
AAGCATCTCA 
TTCATCCCTG 
TCCTGGCTAA 
CCCGACGTCT 
TGGACCGGGC 
ACGTGGGGAA 
GCGTCGACTT 
CCAGTGAGCA 
TGAGCCAGTT 
TTGCAGCTTA 
TGGGTCTCAC 
TGGTECCAGG 
GGAAGAACTT 
GAATCGTGTC 
TGGCAGGCTA 
GCCCAGGAGC 
CTGCCAGACT 
TCGGCATTCT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13B0 
1440 



51 



1 11 21 31 41 ( 

MLSAVARGYQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 
AHWISSRKQ QNVDRAMAKL QGEGLSVAGI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 
AGVNPLVGST LGTSEQIWDK ILSVNVKSPA LLLSQLLPYM ENRRGAVILV SSIAAYNPW 
ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCWPGIIKT DFSKVFHGNE SLWKNFKEHH 
QLQRIGESED CAGIVSFLCS PDASYVNGEN IAVAGYSTRL 



60 
120 
180 
240 
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Seq ID NO: 21 DMA sequence 

Nucleic Acid Accession 8: EOS sequence 

Coding sequence: 77-904 
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21 



31 



41 



51 



1 11 

I I I I I I 

GCGGCGTTTT GCCTCCGCAG CAGCTCTGGG CTCTTCTCAG CTGCGCGAGC AGCTGCTCCA 
ATGCCCCGGA GTGGCCATGG GCGCCCCGCA CTGGTGGGAC CAGCTGCAGG CTGGTAGCTC 
GGAGGTGGAC TGGTGCGAGG ACAACTACAC CATCGTGCCT GCTATCGCCG AGTTCTACAA 
CACGATCAGC AATGTCTTAT TTTTCATTTT ACCGCCCATC TGCATGTGCT TGTTTCGTCA 
GTATGCAACA TGCTTCAACA GTGGCATCTA CTTAATCTGG ACTCTTTTGG TTGTAGTGGG 
AATTGGATCC GTCTACTTCC ATGCAACCCT TAGTTTCTTG GGTCAGATGC TTGATGAACT 
TGCAGTCCTT TGGGTTCTGA TGTGTGCTTT GGCCATGTGG TTCCCCAGAA GGTATCTACC 
AAAGATCTTT CGGAATGACC GGGGTAGGTT CAAGGTGGTG GTCAGTGTCC TGTCTGCGGT 
TACGACGTGC CTGGCATTTG TCAAGCCTGC CATCAACAAC ATCTCTCTGA TGACCCTGGG 
AGTTCCTTGC ACTGCACTGC TCATCGCAGA GCTAAAGAGG TGTGACAACA TGCGTGTGTT 
TAAGCTGGGC CTCTTCTCGG GCCTCTGGTG GACCCTGGCC CTGTTCTGCT GGATCAGTGA 
CCGAGCTTTC TGCGAGCTGC TGTCATCCTT CAACTTCCCC TACCTGCACT GCATGTGGCA 
CATCCTCATC TGCCTTGCTG CCTACCTGGG CTGTGTATGC TTTGCCTACT TTGATGCTGC 
CTCAGAGATT CCTGAGCAAG GCCCTGTCAT CAAGTTCTGG CCCAATGAGA AATGGGCCTT 
CATTGGTGTC CCCTATGTGT CCCTCCTGTG TGCCAACAAG AAATCATCAG TCAAGATCAC 
GTGATGGCAA GATGGTGGCT GGCTTCTCTG CTTATCGCCC CTCATGCAGT GGGCTTCCTT 
TGCTAGGAAG ACAGCCAAGG GAGTTCGAAT AGTTGGGGTG TGGGCTATCT TTTCAAAAAT 
CTATTTGCTG GGGCTCTTAA TTTCTTTAGT GTTCTTTGTA TGTAG GGATT TAAACTTTGT 
CATATGGTAC AAATATTCCC TGCCCCCCTG CAGTTTCCCA TTTGTCTTTC AGTATGTTAA 
TATTTTTGTG CCATACTGGT TTTAAACTTT CATGTTGTCA CATCTGTTAA TCTTTTCTTT 
AGGATTTCTG GATTTTGTGT AATTTTTAAA AAGGTCCCCT CCTC CTCC CT AATGTGTCTG 
TGGACCACCT GGATTCCACT GTACAAGGGG AAAAGTGTCT ATTCCTTTCC CAAAGATGGA 
AAATGGAGGG CTTAGGGACA CTAGATGCAT CTTTCTCAGC ATCACTTCCA GATGCAGTGA 
CTTGTTCGGC TGCGTCCTTA ATGGCCATGG CAGAGCAGTC CCTTGGGGGA TCCAGCCCTG 
TACAATGCAT CTCTTCCTGG AGAAAGCTGG CCTGCTCCAG ACCCCACCAT TCCCAGGCGC 
CCTTGGAGTG GACTCTACTG ATGACAGACA GACCCTCTGA GAGACAAGAC CCTCTGACTC 
TGTGATGGAA GATGCCAGAG ATTTTCCTTT GGGGTAATTG TCCTTAAACA AAACCAAACA 
GATGAAACAC ACACAGGACT TGTGGCTAAA AAGGCTAGTT TTTCACTTGC ATTTCTCAAC 
TAACCCAGGT TTTACATGCA TCTGTGAATC CTTTTACTAC TACCTCTGTG GAGAGATGGA 
GAGACTTCAG ATAAACGTGA AGCTAATGAG TAAAACCCTC TCTGCCAAAA CCTACACTCC 
ACTTTAGGCC CTTCTTGAAG ATGAGCACAA TTTTTAAATA CTGAGCACAA TTTTTAAATA 
CTGACATCAC TTCCTCTTCC CCCTCCCACC CCAGCTCAGC AGCCTCAAAT CTACAGAGAA 
GAAGAATTAT GGCATGAACA TTCCCACAGA CCCACCATCT TTAAGACTTG ACCTCTGTAA 
GTTTACCAAA GGGCTCCTCA CAATTGTGGT GGGGGTTCTG GTTCAAAATT TGGAGCAAAC 
ATGAAGTTTT TGGAAACGTT TTCTCATTTG AAGCCTCCAG TATGCTGTAC TATTCTGGAA 
ATTACCTTCA AGAGTCTCAC TTCTTGTTTC TGTTGTGTTT TCTGTGGGCA TCATGTTCTT 
CACGCTTGCA GTAGAAGGTG CTTTCTCGGT TTCCCAGAGT ATCCAACGGC TCACCTTTCT 
CAAGTGCTGG CAGTAGCTAT GCACTCACGG GCTGGTTTGG GTCGCTGGTG CAGCAGCGCA 
AATCTGTTGC CTTCTGAATT TTTCTCACCT AATGTGACAC TGGCTACAAT GAATCTTCTC 
TTCATCGGGC TGAATGAAAG ATTCAAGAAC CATCTTCAAG GTGCATGGTG GGAATTATCA 
ACCTCAGGGA TACTCATTTT AACTCAGGCG TGTCCTGCTT TGTAACATTC CATTGTTGGG 
AGAGGGCAGG ACAGGTGTGT TCTTCTGTGG GCAGGAGTCA TGTCACTGTC CTACATATGT 
AAGAGTTGGG AAGGTGACGA TTTTTGACAC ATCCAGGAAC TCTTACTCTA GTTAGAATTT 
GTACCAGATC CAAGGTGAAA ACCCCAATAA GCAACTGAAT TTAGAGTTTA AAAATGAATG 
ACTTTATGCT ACATCTGTGG TTATCAAATT ATATAGGTTG TTGAGAAGCA GAACGCTGTT 
TGTAGTAAGA AATCTTTGTG GAACCCCAGT GTGTGAAGTA AATTGTATGT TATTAAATTT 
ATTTAAGGTT AAATTTATGG CATTTACTTA ATAATATATG AGGTCGTGAA AATCCAAATT 
AACAAATTGG TAATTTCCAA GGTAGAAAAA TTAGGTGTTG AATGAATGTA TATGTTGGTT 
TATTATAGTT TTATATATAT ATAGAGAGAG TGTTTTGTTT TTGAGTCAGG GTCTTGCTCT 
ATTACCCAGG CTGGAGTGCA GTGGTGCAGT CATGACTCAC TGTAGCCTCT GTCTCCCAGG 

c^Sag5gat Stctcacct cagcctcccc aggagctggg agtactggtg cotgcctcca 

rrCCCAGCTA ATTTTTGTAT TTTTTCATAG AGATGGGGTT TCACCATTTT GCCCAGGCTG 
GTCTCAAACT TCTGGGCTCA AGAGATTCGC CCGCCTCGGC CTCC^AAAGT GCTGGGAATA 
GGCATGAGCC GTCACGCCTG GCCTAAAAAA TATTTTTAAA ATGATCTTTG AATTAAATAT 
^AG^S TCTAATGTAT CTCTTTGAGA CCTAGGAGGT TGATGGAACA G^CTGCTGT 
TAAGTCCTTT GGGTTTCAAG TCTAGAATTT TTTAAAGGCA AATATCAGCT CATTCTTATT 
TTAGATTGAC CTTATCAGGC ATGGATTCTG gtctcatcta crrrATCGTA taaatgctcc 
AAGGTAGGGG GTTTGGTATA TATTTTAAGC CCGGCTTTTT A 1 1 1 1 n i rT TTmrTTTT 
TTTTTAATGT GAGAAGCAGA ATCTGCTTCT AGAAACTGGT mAAAGAGA ^CTGAGA 
AAGAAATGTG GAATGGAGTA TATTTGAGGA GGACAAAACA TAACTTCACT TTTGAACAGA 
AATCACTCTA GCTTGCCAGC ATGGGATGTA AACCAAGAGA GTAGAAATAT ACC^ATCTTA 
WTAAGTTG GGTTTATGGC ATCGCTCATA TATGTAAAAG CACTACAAAC TOTTAAAGA 
AAATTGGGAA ACTACAGAGA AGTCAAAGAA AAAAAAAAGT AACCCATATT TCTATTGCCC 
aggtataatc CTTGTTAATA TTTTGGTTTG gtctcctctt TTTTTCCCCC AATATAGTTG 

taaataaatg atgtctttca gagttgacat ttatcctgta gcttgaatgg catgtaaatg 
ccagttgtat attttttcat gaagtgtagg tttggaatac actagagtta GCTATATGCT 

TGAATGCTGA TCACTGGATT CTGAGACTGA CTACTGAGTC TACCTTTTTA ATCAAGCCTA 

I^atgaatgg gctccaaaaa gtaatgaatg taattgtact ttttgatgtg cctctgcact 
^cSggS agtcatcata aatagctgtt aaatatgtga ctttacagat tttgatatgt 
tcagattgta aaaaatgaat agtttatttc attaattgat gggcagtcaa gaatctccct 
cc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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13B0 
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1740 
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2040 
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2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
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Seq ID NO: 22 Protein sequence: 
Protein Accession 8 : EOS sequence 

1 11 21 31 41 SI 

I I I I I I 

MGAPHWWDQL QAGSSEVDWC EDNYTIVPAI AEFYNTISNV LFFILPPICM CLFRQYATCF 
NSGIYLIWTL LVWGIGSVY FHATLSPLGQ MLDELAVLWV LMCALAMWFP RRYLPKIFRN 
DRGRFKVWS VLSAVTTCLA FVKPAINNIS LMTLGVPCTA LLIAELKRCD NMRVPKLGLF 
SGLWWTLALP CWISDRAFCE LLSSFNFPYL HCMWHILICL AAYLGCVCFA YFDAASEIPE 
QGPVIKFWPN EKWAFIGVPY VSLLCANKKS SVKIT 

Seq ID NO: 23 DNA sequence 

Nucleic Acid Accession #: CAT cluster 



60 
120 
180 
240 



1 
I 

TGAAGTATCT 
CCAGATACGC 
CAATCCTTAT 
CTCATCACAT 
GTGCTATCTA 
ACGGTGGTCA 
TTTGAATGAA 
GAAAAACATC 
ACTCAATACA 
GCGTCTACCA 
TAAGCAATGA 
CTGAACCTCA 
TGGTCTGAAA 
CACAGAGTCA 
CATCAAAGCA 
TTTCCATTTA 



11 
I 

CTGTTTAATG 
AGGAGGACAC 
GACTCTGTTT 
ACCTGATCCC 
AGTTGGAGGT 
AAGTAAATAG 
ATTGGTTTGT 
CTGACTCTTA 

CGTTGCAAAG 
AATTTCAAGC 
AGATTCAAAT 
GTACCATTTC 
ATCAGATTCA 
TTTCATGCCC 
AAGACTTTCC 



21 
I 

TCATATATCT 
ATTCTTGATC 
AAAATGAGTA 
ATTTAACTTC 
TCTGCTTGAG 
CTAACATATT 
CATTGACCAC 
TGTATGCTTG 
CCAACATTTA 
ATTCAACATT 
AACAATATTA 
GGATTTTCAT 
TGCATTTATT 
AAATGTTAAA 
GTCAGCAACT 
TTGGCTGACT 



31 
I 

GAGATGTTCT 
GGTATTTTAT 
TCTGCATTAT 
AGTCTATCCC 
TATTCTTAAT 
GGCATCAAAC 
TGTAAGAAAC 
CCTGTCTCTG 
TCTCCTATTT 
TCTCTTAAAA 
CTTCTCTCAT 
TGAAATTCAC 
TTAGCCCATG 
ATCCTTCCTG 
TTTTAATGCA 
TCTCTGATGA 



41 
[ 

GTGGAGCCAG 
TCTATTATTT 
TGTGCAAACA 
TCCATTACTT 
TTTTCTTCTT 
CATTGOCTAT 
TCATTTGTAA 
AACTCCACCT 
CAGGAGAAAG 
ATAGGAGTTT 
GTCCTTTGCT 
CACGATCCCC 
GAATAACTGT 
CTTGGAGTTT 
TTTGCTCATG 
GGTTTCCTGC 



51 
1 

AATTCAAAAC 
TTATTAGTTC 
CTTCTGAGAT 
TTTGTGGAAA 
TTTAATGCAT 
TGAAGCAAGC 
AGATAATACT 
CTATCAGCAT 
TTTTGTGGGA 
CTTTTAGATA 
TTTTTAAGAT 
TCTGCAGCTG 
GCTGAGAAAC 
TCCGTCTTCA 
GTTTGCAGAA 
TTGCCTC 



Seq ID NO: 24 DNA sequence 

Nucleic Acid Accession 8« Eos sequence 



I 



GTGTGTAGTC 
AGTAGGCTCA 
GAAAGTGGTT 
ACTACTTGGC 
ACCCTCTGTG 
CTGTTCCTGG 
AAAGGAACAT 
OCAACAGTCC 
CCTGGTGCAA 
GTCCTCCGGA 
GCAAGCAGGA 
AAACGAGGAG 
CGGAAAACTC 
CAGCACAGTT 
CAGAGGGGAT 
AAAAAGCAAA 
AAAGAAACTC 
AAAACTTTCT 
ATAGAGGTGG 
ATCTTTACAA 
TTCAATAGGC 
TAAAAAGAAG 
CAAAAAGTAA 
GAAGTGTTTG 
ATAAAAATAA 
AATTCTGGCT 



11 
I 

GGGTATGAGA 
GTAGCACGAA 
TAGCATGCTG 
CTCTGGGACT 
AGGAGGATGC 
TAAGCTCCTG 
CTACACACAG 
CCATTCTACT 
AGAACAGTCG 
AAAGGGACGT 
AACCTCATCA 
CAAATGCATT 
CAAGCAGGAA 
ATTCCATGGG 
CGTGGTGAAT 
GGACATGAGA 
CTATTTTTAA 
CCTGAAATAG 
AGTTCAGAGA 
ATGAGTTTCT 
AATGGTTTGA 
AAAAATTAAG 
TGGAGGGATA 
CACAATAATG 
TAGAATAAAA 
CCACAGAACT 



21 
I 

ACGATTGCAA 
CTGCAAATAC 
GAGCTGGTTT 
CAGATTCCTC 
TGTGAGGACA 
GAAAGAAGTC 
AGGAAATGAA 
TGCAGAAAGG 
TTTCTCCAAA 
GAAGCCCCCT 
GAGAAGTCAG 
AAAAAGTTGC 
GGATTTTAAC 
CTAAAATAAA 
TTCAATGAAA 
GAAGTAATAT 
GAGAAATGTT 
GAGATAAATG 
CAGGCAAGCA 
TACAGTGGTC 
TGCCAATATG 
AATACTCAAG 
GACTGAAGTT 
CAGATACTCA 
TACCGATCAA 
TGTCAGATAT 



31 
I 

GCAAAAGCAG 
TTGAGAAGGG 
TCTGCTTGGT 
CTCTTTAAAA 
AAATGAGATC 
TATGGATTAT 
TAACATGAAG 
TTGCTTACAC 
ACTGAAGCTG 
TTGTAATTTC 
CCAAGGAAAG 
TGACGGGCAT 
ATTTTGAATC 
TGCAGAAATG 
ATCCATTTGA 
TGTTGCTTGA 
GAATCTTTGC 
TTGGAAAGAG 
TACATAAGAG 
AATGACAAAC 
TTAGCTATTT 
CAGAACCTCC 
AAATGGGATC 
TTTTAAACAG 
GAATGTGTCC 
ATGACATTAA 



41 
I 

CTGAGGGACG 
AAAAGTTCAG 
AGCCCTACAA 
TGGTGCTAAT 
ATCCACATAA 
ACTCAACCTA 
TGAAGTCTTC 
TGAAAATCAG 
GAAATTATCT 
TGCATTAGCG 
TCTTTAAATG 
GAAATGCTTT 
TGATTGACTC 
GTACTTTCAG 
ATCTTGAGGT 
AATTTCATTG 
AAOGTGGTAG 
GCAATGTATT 
TCAGGATGTT 
CAATTTCATT 
ACTTTGACCA 
AACTTAGATA 
AGGTATGTGA 
AGTCATAAGG 
TCCTGOGTAT 
AC 



60 

120 

ISO 

240 

300 

360 

420 

4 80 

540 

600 

660 

720 

780 

840 

900 



51 
1 

CTGTCCAAGG 

CCTTGTGCTG 

CTTTGGCCCA 

AATAGCACCC 

GCCGTGAACC 

CACTCCAGTT 

ATCTCCATTC 

TTTATTTTCC 

GAAATATCAG 

TGCTCTCCTG 

GAAATTGTGC 

GATGTGAAGA 

TGTGGTTTCT 

ACCACAGCTG 

TCAGATCTTA 

CTTATATCTA 

ACGCTCCCAC 

GAGTATGCTG 

TTTCAGTATT 

CAAAGCTTGC 

CCGTATGCAT 

GCACTTTCCA 

TGAGATCTCA 

ATTGGAACTA 

CTGGGTTTTG 



Seq ID NO: 25 DNA sequence 

Nucleic Acid Accession #: Eos sequence 



GTGTGTAGTC 
AGTAGGCTCA 
GAAAGTGGTT 
ACTACTTGGC 
ACCCTCTGTG 
CTGTTCCTGG 
CCCCATTCTA 
AAAGAACAGT 



11 

GGGTATGAGA 
GTAGCACGAA 
TAGCATGCTG 
CTCTGGGACT 
AGGAGGATGC 
TAAGGAAATG 
CTTGCAGAAA 
CGTTTCTCCA 



21 
I 

ACGATTGCAA 
CTGCAAATAC 
GAGCTGGTTT 
CAGATTCCTC 
TGTGAGGACA 
AATAACATGA 
GGTTGCTTAC 
AAACTGAAGC 



31 
I 

GCAAAAGCAG 
TTGAGAAGGG 
TCTGCTTGGT 
CTCTTTAAAA 
AAATGAGATC 
AGTGAAGTCT 
ACTGAAAATC 
TGGAAATTAT 



41 
1 

CTGAGGGACG 
AAAAGTTCAG 
AGCCCTACAA 
TGGTGCTAAT 
ATCCACATAA 
TCATCTCCAT 
AGTTTATTTT 
CTGAAATATC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1S00 



51 
1 

CTGTCCAAGG 
CCTTGTGCTG 
CTTTGGCCCA 
AATAGCACCC 
GCCGTGAACC 
TCCCAACAGT 
CCCCTGGTGC 
AGGTCCTCCG 



60 

120 

180 

240 

300 

360 

420 

480 



236 



WO 03/003906 



PCT/US02/21338 



GAAAAGGGAC GTGAAGCCCC CTTTGTAATT 
GAAACCTCAT CAGAGAAGTC AGCCAAGGAA 
AGCAAATGCA TTAAAAAGTT GCTGACGGGC 
TCCAAGCAGG AAGGATTTTA ACATTTTGAA 
5 TTATTCCATG GGCTAAAATA AATGCAGAAA 
ATCGTGGTGA ATTTCAATGA AAATCCATTT 
AAGGACATGA GAGAAGTAAT ATTGTT GCTT 
TCCTATTTTT AAGAGAAATG TTGAATCTTT 
CTCCTGAAAT AGGAGATAAA TGTTGGAAAG 

10 GGAGTTCAGA GACAGGCAAG CATACATAAG 
AAATGAGTTT CTTACAGTGG TCAATGACAA 
GCAATGGTTT GATGCCAATA TGTTAGCTAT 
AGAAAAATTA AGAATACTCA AGCAGAACCT 
AATGGAGGGA TAGACTGAAG TTAAATGGGA 

15 TGCACAATAA TGCAGATACT CATTTTAAAC 
AATAGAATAA AATACCGATC AAGAATGTGT 
CTCCACAGAA CTTGTCAGAT ATATGACATT 



TCTGCATTAG CGTGCTCTCC TGGCAAGCAG 540 

AGTCTTTAAA TGGAAATTGT GCAAACGAGG 600 

ATGAAATGCT TTGATGTGAA GACGGAAAAC 660 

TCTGATTGAC TCTGTGGTTT CTCAGCACAG 720 

TGGTACTTTC AGACCACAGC TGCAGAGGGG 780 

GAATCTTGAG GTTCAGATCT TAAAAAAGCA 840 

GAAATTTCAT TGCTTATATC TAAAAGAAAC 900 

GCAACGTGGT AGACGCTCCC ACAAAACTTT 960 

AGGCAATGTA TTGAGTATGC TGATAGAGGT 1020 

AGTCAGGATG TTTTTCAGTA TTATCTTTAC 1080 

ACCAATTTCA TTCAAAGCTT GCTTCAATAG 1140 

TTACTTTGAC CACCGTATGC ATTAAAAAGA 1200 

CCAACTTAGA TAGCACTTTC CACAAAAAGT 1260 

TCAGGTATGT GATGAGATCT CAGAAGTGTT 1320 

AGAGTCATAA GGATTGGAAC TAATAAAAAT 1380 

CCTCCTGCGT ATCTGGGTTT TGAATTCTGG 1440 
AAAC 



Seq ID NO: 26 DNA sequence 
20 Nucleic Acid Accession fh NM_0132B2.2 
Coding sequence: 85.. 2466 

1 11 21 31 

„ | I I i 

25 CGACTCCTTA GAGCATGGCA TGGCTCAGAG GTGCTGGTAA 

GTCCCTCCCC TCAGCGCCGA CACCATGTGG ATCCAGGTTC 
ACCCACACGG TGGACTCGCT GTCCAGGCTG ACCAAGGTGG 
CAGGAGCTGT TCCACGTGGA GCCAGGCCTG CAGAGGCTGT 
GAGGACGGCC ATACCCTCTT CGACTACGAG GTCCGCCTGA 
30 GTCCGCCAGA GCCTCGTGCT CCCCCACAGC ACCAAGGAGC 
ACCGACTCCG GCTGCTGCCT GGGCCAGAGT GAGTCAGACA 
GCGGCCGCCG AGACTGACAG CAGGCCAGCC GATGAGGACA 
GGGCTGTACA AGGTCAATGA GTACGTCGAT GCTCGGGACA 
GAGGCGCAGG TGGTCAGGGT GACGCGGAAG GCCCCCTCCC 
35 ACGTCCAGGC CGGCGCTGGA GGAGGACGTC ATTTACCACG 
GAGAACGGCG TGGTCCAGAT GAACTCCAGG GACGTCCGAG 
AAGTGGCAGG ACCTGGAGGT GGGCCAGGTG GTCATGCTCA 
AAGGAGCGGG GCTTCTGGTA CGACGCGGAG ATCTCCAGGA 
CGGGAACTCT ACGCCAACGT GGTGCTGGGG GATGATTCTC 
40 TTCGTGGACG AAGTCTTCAA GATTGAGCGG CCGGGTGAAG 

CCCATGAGAC GGAAGAGCGG GCCGTCCTGC AAGCACTGCA 
TGCCGGGTCT GCGCCTGCCA CCTGTGCGGG GGCCGGCAGG 
TGCGATGAGT GCOACATGGC CTTCCACATC TACTGCCTGG 
CCCAGCGAGG ACGAGTGGTA CTGCCCTGAG TGCCGGAATG 
45 GCGGGAGAGC GGCTGAGAGA GAGCAAGAAG AAGGCGAAGA 
TCACAGCGGG ACTGGGGCAA GGGCATGGCC TGTGTGGGCC 
GTCCCGTCCA ACCACTACGG ACCCATCCCG GGGATCCCCG 
CGAGTCCAGG TCAGCGAGTC GGGTGTCCAT CGGCCCCACG 
AGCAACGACG GAGCGTACTC CCTAGTCCTG GCGGGGGGCT 
50 GGGAATTTTT TCACATACAC GGGTAGTGGT GGTCGAGATC 
GCGGAACAGT CTTGTGATCA GAAACTCACC AACACCAACA 
TTTGCTCCCA TCAATGACCA AGAAGGGGCC GAGGCCAAGG 
GTCAGGGTGG TGCGCAATGT CAAGGGTGGC AAGAATAGCA 
AACCGCTACG ATGGCATCTA CAAGGTTGTG AAATACTGGC 
55 TTTCTCGTGT GGCGCTACCT TCTGCGGAGG GACGATGATG 

GAGGGGAAGG ACCGGATCAA GAAGCTGGGG CTGACCATGC 
GAAGCCCTGG CCAACCGAGA GCGAGAGAAG GAGAACAGCA 
CAGGAGGGGG GCTTCGCGTC CCCCAGGACG GGCAAGGGCA 
GGAGGTGGCC CGAGCAGGGC CGGGTCCCCG CGCCGGACAT 
60 CCCTACAGTC TCACGGCCCA GCAGAGCAGC CTCATCAGAG 
CTGTGGAATG AGGTCCTGGC GTCACTCAAG GACCGGCCGG 
TTGTTCCTGA GTAAAGTGGA GGAGACGTTC CAGTGTATCT 
CGGCCCATCA CGACCGTGTG CCAGCACAAC GTGTGCAAGG 
CGGGCACAGG TGTTCAGCTG CCCTGCCTGC CGCTACGACC 
65 CAGGTGAACC AGCCTCTGCA GACCGTCCTC AACCAGCTCT 

CGGTGATCTC CAAGCACTTC TCGACAGGCG TTTTGCTGAA 
CATCGGCACT GATTTTGTTC TTAGTGGGCT TAACTT AAAC 
CCTAAAAAGG TTTGTCTTCC TTTTTTTTTA TTTTTATTTT 
GAATTTATGT ATTCTGGCTA AAAGTTGGAC TTCTCAGTAT 
70 CATAAAAGCC TGCAATTTCT CGACAAAACA ACACAAGATT 
ACTACGTGGT GTGGAGGCTG TTGATGTTTC TGGTGTCAAG 
CAACTCTTTA AGAAGGCGAC AGGATCAGTC CTTCTCTAGG 
AGCAAGCATC TTCCTGACAG CATTTTGTCA TCTAAAGTCC 
TGGCCCGTGG CAGCCCGTGG CATGGCGTGG CTCAGCTGTC 
75 AAAGAGGAAA CATCTCGGGC CTAGTTCAAA CCTTTGCCTC 

TGCTTAGCGT CTGAGATCCG CGTGAAAAGT CCTCTGCCCA 
CACGCAGAAA TGGCCTCAAG GGGACTCTGC TCCACGTGGG 
TGTCCGACGA AGGCGGCCAC GGACGGACGC CAGCACACGA 



41 
I 

AACTGATGGG 
GGACCATGGA 
AGGAGCTGAG 
TCTACAGGGG 
ATGACACCAT 
GGGACTCCGA 
AGTCCTCCAC 
TGTGGGATGA 
CGAACATGGG 
GGGACGAGCC 
TGAAATACGA 
CGCGCGCCCG 
ACTACAACCC 
AGCGCGAGAC 
TGAACGACTG 
GGAGCCCCAT 
AGGACGACGT 
ACCCCGACAA 
ACCCGCCCCT 
ATGCCAGCGA 
TGGCCTCGGC 
GCACCAAGGA 
TGGGCACCAT 
TGGCTGGCAT 
ATGAGGATGA 
TTTCCGGCAA 
GGGCGCTGGC 
ACTGGCGGTC 
AGTACGCCCC 
CCGAGAAGGG 
AGCCTGGCCC 
AGTATCCAGA 
AGAGGGAGGA 
AGTGGAAGCG 
CCAAGAAAAC 
AGGACAAGAG 
CGAGCGGCAG 
GCTGTCAGGA 
ACTGCCTGGA 
TGGGCCGCAG 
TCCCCGGCTA 
AACGTGTCGG 
AGGTAGTGTT 
TCAAATCTAT 
TGTGTTTAGT 
TTTTAAAGAT 
TTCTCAGAAG 
GTTCTGGCCC 
AGTGACATGG 
TGTTGAAGTT 
AAAGCCATCC 
CGAGAGCAGG 
GCCAGGCGTG 
AGTCACGTGC 



51 



GGTTTTTGCT 
CGGGAGGCAG 
GCGGAAGATC 
CAAACAGATG 
CCAGCTCCTG 
GCTCTCCGAC 
CCACGGCGAG 
GACGGAATTG 
GGCGTGGTTT 
CTGCAGCTCC 
CGACTACCCG 
CACCATCATC 
CGACAACCCC 
CAGGACGGCG 
TCGGATCATC 
GGTTGACAAC 
GAACAGACTC 
GCAGCTCATG 
CAGCAGTGTT 
GGTGGTACTG 
CACATCGTCC 
ATGTACCATC 
GTGGCGGTTC 
ACACGGCCGG 
CGTGGACCAT 
CAAGAGGACC 
TCTCAACTGC 
GGGGAAGCCG 
CGCTGAGGGC 
GAAGTCCGGG 
TTGGACGAAG 
AGGCTACCTG 
GGAGGAGCAG 
GAAGTCGGCA 
CAAGGTGGAG 
CAACGCCAAG 
CCCGTTCCAG 
GCTGGTGTTC 
CAGATCCTTT 
CTATGCCATG 
CGGCAATGGC 
AGGGCTCGTT 
TCCTCCGTTC 
ACATTTTCAG 
TCTTTGAAAA 
GGAATCAGAA 
TTGCTGCCAC 
CCAAGGTCAG 
TTCCCCGTGG 
GTTGCAAGGA 
CCCACCAGAC 
GAGTTGGGGC 
TGACTGACGC 
AAGTGCCTTT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 



237 



WO 03/003906 
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GATTCGTTCC TTCTTTCTAA AGACGACAGT CTTTGTTGTT AGCACTGAAT TATTGAAAAT 3300 

GTCAACCAGA TTCTAGAAAC TGCGGTCATC CAGTTCTTCC TGACACCGGA TGGGTGCTTG 3360 

GGAACCGTTT GAGCCTTATA GATCATTTAC ATTCAATTTT TTTAACTCAG CAAGTGAGAA 3420 

CTTACAAGAG GGTTTTTTTT TAATTTTTTT TTCTCTTAAT GAACACATTT TCTAAATGAA 3480 

5 TTTTTTTTGT AGTTACTGTA TATGTACCAA GAAAGATATA ACGTTAGGGT TTGGTTGTTT 3540 

TTGTTTTTGT ATTTTTTTTC TTTTGAAAGG GTTTGTTAAT TTTTCTAATT TTACCAAAGT 3600 

TTGCAGCCTA TACCTCAATA AAACAGGGAT ATTTTAAATC ACATACCTGC AGACAAACTG 3660 

GAGCAATGTT ATTTTTAAAG GGTTTTTTTC ACCTCCTTAT TCTTAGATTA TTAATGTATT 3720 

AGGGAAGAAT GAGACAATTT TGTGTAGGCT TTTTCTAAAG TCCAGTACTT TGTCCAGATT 3780 

10 TTAGATTCTC AGAATAAATG TTTTTCACAG ATTGAAAAAA AAAAAAAA 3828 

Seq ID NO: 27 Protein Sequence 
Protein Accession ft: NP_037414.2 

15 1 11 21 31 41 51 

| | I i I I 

MWIQVRTMDG RQTHTVDSLS RLTKVEELRR KIQELPHVEP GLQRLFYRGK QMEDGHTLFD 60 

YEVRLNDTIQ LLVRQSLVLP HSTKERDSEL SDTDSGCCLG QSESDKSSTH GEAAAETDSR 120 

PADEDMWDET ELGLYKVNEY VDARDTNMGA WFEAQWRVT RKAPSRDEPC SSTSRPALEE 180 

20 DVIYHVKYDD YPENGWQMN SRDVRARART IIKWQDLEVG QWMLNYNPD NPKERGFWYD 240 

AEISRKRETR TARELYANW LGDDSLNDCR IIFVDEVFKI ERPGEGSPMV DNPMRRKSGP 300 

SCKHCKDDVN RLCRVCACHL CGGRQDPDKQ LMCDECDMAP HIYCLDPPLS SVPSEDEWYC 360 

PECRNDASEV VLAGERLRES KKKAKMASAT SSSQRDWGKG MACVGRTKEC TIVPSNHYGP 420 

IPGIPVGTMW RFRVQVSESG VHRPHVAGIH GRSNDGAYSIi VLAGGYEDDV DHGNFFTYTG 480 

25 SGGRDLSGNK RTAEQSCDQK LTNTNRALAL NCFAPHJDQE GAEAKDWRSG KPVRWRNVK 540 

GGKNSKYAPA EGNRYDGIYK WKYWPEXGK SGPLVWRYLL RRDDDEPGPW TKEGKDRIKK 600 

LGLTMQYPEG YLEALANRER EKENSKREEE EQQEGGFASP RTGKGKWKRK SAGGGPSRAG 660 

SPRRTSKKTK VEPYSLTAQQ SSLIREDKSN AKLWNEVLAS LKDRPASGSP FQLFLSKVEE 720 

TFQCICCQEL VFRPITTVCQ HNVCKDCLDR SFRAQVFSCP ACRYDLCRSY AMQVNQPLQT 780 

30 VLNQLFPGYG NGR 



Seq ID NOt 28 DNA sequence 
Nucleic Acid Accession ft: NM_000756.1 
35 Coding sequence: 186. .776 

1 11 r i 1 r r 

AGAAACTCAG AGACCAAGTC CATTGAGAGA CTGAGGGGAA AGAGAGGAGA GAAAGAAAAA 60 

40 GAGAGTGGGA ACAGTAAAGA GAAAGGAAGA CAACCTCCAG AGAAAGCCCC CGGAGACGTC 12 
TCTCTGCAGA GAGGCGGCAG CACCCGGCTC ACCTGCGAAG CGCCTGGGAA GCGAGTGCCC 
CTAACATGCG GCTGCCGCTG CTTGTGTCCG CGGGAGTCCT GCTGGTGGCT CTCCTGCCCT 
GCCCGCCATG CAGGGCGCTC CTGAGCCGCG GGCCGGTCCC GGGAGCTCGG CAGGCGCCGC 
AGCACCCTCA GCCCTTGGAT TTCTTCCAGC CGCCGCCGCA GTCCGAGCAG CCCCAGCAQC 

45 CGCAGGCTCG GCCGGTCCTG CTCCGCATGG GAGAGGAGTA CTTCCTCCGC CTGGGGAACC 
TCAACAAGAG CCCGGCCGCT CCCCTTTCGC CCGCCTCCTC GCTCCTCGCC GGAGGCAGCG 
GCAGCCGCCC TTCGCCGGAA CAGGCGACCG CCAACTTTTT CCGCGTGTTG CTGCAGCAGC 540 
TGCTGCTGCC TCGGCGCTCG CTCGACAGCC CCGCGGCTCT CGCGGAGCGC GGCGCTAGGA 60 0 
ATGCCCTCGG CGGCCACCAG GAGGCACCGG AGAGAGAAAG GCGGTCCGAG GAGCCTCCCA 

50 TCTCCCTGGA TCTCACCTTC CACCTCCTCC GGGAAGTCTT GGAAATGGCC AGGGCCGAGC 
AGTTAGCACA GCAAGCTCAC AGCAACAGGA AACTCATGGA GATTATTGGG AAATAAAACG 
GTGCGTTTGG CCAAAAAGAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 
TCTGTACCAT AGCGCTGCTC TTATGCCATT TGTTTATTTT TATATAGCTT GAAACATAGA 
GGGAGAGAGG GAGAGAGCCT ATACCCCTTA CTTAGCATGC ACAAAGTGTA TTCACGTGCA 

55 GCAGCAACAC AATGTTATTC GTTTTGTCTA CGTTTAGTTT CCGTTTCCAG GTOTTTATAG 
TGGTGTTTTA AAGAGAATGT AGACCTGTGA GAAAACGTTT TGTTTGAAAA AGCAGACAGA 
AGTCACTCAA TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTGGGTGGG 
TAAGACTAAA TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT Cj^TAATAAA 1200 
ACATCTTTCC AGTCCTTGGT CAGTTTGGTT GTGTAAGAGA ATGTTGAATA CTTATATTTT 1260 

60 TAATAAAAGT TGCAAAGGT 

Seq ID NOt 29 Protein Sequence 
Protein Accession 8: NP_000747.1 

65 i ii » » 

LlPLLVSAG VLLVALLPCP PCRALIiSRGP VPGARQAPQH PQPLDFFQPP PQSEQPCJPQ 60 

ARPVLLRMGE EYFLRLGNLN KSPAAPLSPA SSLLAGGSGS RPSPEQATAN FFRVLLQQLL 120 

LPRRSLDSPA ALAERGARNA LGGHQEAPER ERRSEEPPIS LDLTFHLLRE VLEMARAEQL 180 
70 AQQAHSNRKL MEIIGK 

Seq ID NO: 30 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 

75 x xi 2i y. *i si 

Itttcttgag atggctcctc ttggaactat gcatctctcc TQATACTTGG ATGCTTTTCC 60 
TCTGACTGAT QAAQATCCTG AATACCAAAG AGGGCC3CTG AAAGGGCTCT AGGAGTACAC 120 

238 



120 
180 
240 
300 
360 
420 
480 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 



WO 03/003906 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



CTTCTAGGAA 
CTTTGAAGAG 
TGAGGTTGCT 
CATTTGGTTT 
AGAAATGGGT 
AACTCCCTAA 
AATGTGTTTG 
TGTGCTCAAT 



CCCTAAGCCA 
CAGTCAGACT 
GGGGGAACTC 
GGGTGTCAGT 
TCAAATTTTA 
AATTCAGTTT 
TGAAGATTAA 
AAGTGTTTAT 



GAGAGAGGCT 
CCTGCTTTGC 
TTTTACAACA 
AGCATGATGG 
CCTGCAACTC 
TTTCTTTGTA 
AAAAGTTAAC 
TTATTTATTG 



TCACTACATC 
TCTTCAGACT 
TCTCTCTGAA 
AAAGACAAAA 
ACCAATTCTG 
AAATGGGGTT 
ATAAAGAGTT 
CTGAATAAAC 



ATGCTTCCTG 
TAATTTGGGG 
AGAATCCGGG 
AAACACAACT 
TGGCCTTGGT 
ATGAACAGTA 
TAGAAGAGTG 
CAGTAATTTA 



ACATCTCTCC 180 

GTTTAACAGG 240 

CTGCCAGTTT 300 

TGACATCTGC 360 

TCAGCAATTA 420 

CCTACTTCAA 4 BO 

TCTGGCATAT 540 
ATTAGTAT 



Seq ID NO: 31 DNA sequence 

Nucleic Acid Accession 8: Eos sequence 

Coding sequence: 134.. 1917 



1 
I 

CACCTCAGCC 
TCAGGAGAGT 
AAACCTTATT 
GAGTGTCCCG 
GGACAGACAC 
GGTTGAGGCT 
AGACCCTCCC 
CCAAGCCGCG 



11 
! 

CCAACCCCAA 
GTGGCTCTCC 
GCCATGGTTC 
AGGGCGGTTC 
GGCCCCTCTT 
GGAGGTCAGC 
CCAGGCCTAA 
ATGCCCAGGG 
TCTGCGAGGC 
CCGAGCTCGG 
GGGGAGGGGG 
CGGAAGCTCA 
AAGCCCAGCT 
CTGGGCAGTT 
CAGGAGACCT 
TACCGCCGGC 
CGGGCGGTGC 
CCCAGGGGCC 
GCGCACCTGC 
GAGGAGGCCC 
GGGGGCATCC 
CCTCTGACCA 
CTGGTGGOCC 
TGCTACGACC 
CCTGTGGCCC 
TTCTACCGCC 
GCCAGCCACC 
CTGCTGCGGG 
AGGGCTGCCT 
ACCATTTACT 
CAGTTCCAGG 
TTCATCCTGA 
GAACCAAAGC 
GGGCTCAGGG 
TTCTGCTTTA 
CTCACTCCAC 
GCTTGCTGAC 
TCTCCACAGA 
AAAACATTCA 
ACGCCCTCGC 
GCTATCTGGG 
CTGGTCCGCT 
GGCCAGGCCC 
CAGAAGCCCA 
TAGTTCTGCC 
TGTCTATAAA 
ACAGCTATGA 
TCCCCTTACQ 
CCAACCTCCA 
GCACCACTGG 



21 



31 



CCCCAGCCGC 
CCAGGCGCTA 
TTAGAAGCCA 
CCGGGAGCCC 
CTTCAGTGGG 
CACAGCCAAG 
GAGGAGAGGG 
GCCCCGCACA 
GCTCACAGCC 
AGCAGGAGGT 
GCAGCCCTGC 
TGGTGTTTCT 
CCCGGGCCCT 
GCCTGGACGT 
AGGCGCTGAT 
TGAGCGCGGC 
TGGGCGTCCT 
CTCGTGGCGA 
ATGTGTTCAA 
CGCTTCGGGG 
GTGGCTCCGG 
ACATCTGGAG 
TGGACGGGCT 
CGCGAACAGA 
ACGAGGCTGT 
TGCTCAGGTA 
GGCGTTCCAG 
GCGTGGGCGC 
CCCTGCCCCT 
GCCTCAACCC 
CCAAGGAGCT 
CTCTGCCCCC 
TGCTTCGCTG 
AAGGGGCTGG 
AAGGTTGTCG 
ACCCAGACTG 
ACCCCCCTGT 
CAAGGACTTG 
CACTTCCTAT 
CTCTGGGACT 
TCTGTTTGGC 
GGTGCAGCCC 
ACTCCGGGCT 
CCCCACTTCT 
CCTTTTTAGC 
ATGTGGACAG 
GCACCTCGCA 
CACTTTGAAA 
GCCTGGATGT 
CACCAAATAC 



ATCCCCTGCC 
CCAGGAGGGG 
CCCCTTCCCC 
CGTGGGTCCC 
GACAGTCATA 
AAGCTCCGAG 
AACCAGGGAG 
GCCCCCCGCG 
GGTACCCCAG 
CAGGCCGGCC 
CGGCCGCAGC 
GCAGAGGCCC 
GGAGCCCGCC 
GCTGGCCTTT 
GAGCGACAAC 
CGACCGCGAG 
CGTACTGCCC 
GGAGCCTCCT 
CCCCCGGGAG 
CTGCGGTCTC 
TGCCAAGGCC 
CCAGGTTCGG 
GCTCTATGCC 
CGCCTGGACC 
GGCCTGCCGT 
CAGCCCCGTG 
CGACATCGTG 
CGCCGTGATG 
GCCCGCCCCC 
CCAGGTCACT 
GCAGCCCTTC 
TGAGGACCGG 
CTCTCCAGGG 
GATCGGAACT 
ATTATTTTGA 
TTTCCTGACT 
CTGTGGGACT 
GCTCGCTGGA 
GCTCTGTCAG 
TTCTGCCTGT 
ACTGTGGATT 
TGCTGTCTGC 
CACCACCCTC 
CGCCAACCCC 
TGTGTAGACT 
TGGACGTCTG 
CGGTGTCCAG 
CCCATGCTAG 
CCCTGTCTGG 
ATTTTAATTC 



41 
I 

CCAGCTGACG 
CAGGTCTCAG 
AGGCAAGACA 
AGCACTTCCA 
GGGACAGGTA 
ACCAACGGAT 
AAAAGTCTAG 
CAGAGGCCGC 
CTACGGAAAC 
GCCTCGGGGG 
GGGGCGCTCA 
GGGGGTTGGG 
ACGGCGGCAG 
GCCCAGCAGC 
CTGCTGOGAG 
CGCATCCTCA 
AGCCTCTACC 
GCGGCGGCCC 
AACACCTGGC 
TGCACCATGC 
GTCTGCTCCA 
CCCATGCAGC 
ATCGGTGGCG 
CCACGCGCGC 
GGGGACATCT 
AAGGATGCTT 
GCACTGGGGG 
CGCTACAACA 
GCCCCACTGC 
GCCACCTTCA 
CCCTTGGGGA 
CTGCAGACCT 
AGACCCTCCT 
TCCTGCTCTT 
AGCCCAGACT 
CAATTCCGTA 
CCCTATTCCC 
GCTCTGCTGA 
CAGGACAGGG 
CCTAAGGCCT 
CTCAAGGGCC 
AGCTCCTGCC 
TGCAGCCTTG 
CGATCTCTAA 
TGGACGAGAC 
TCACCCAAGA 
GATGCACAGC 
AAAAGTGAAT 
GCCCTTTTTC 
ACCGAAAGCA 



51 



I 

GGTCAAAGCC 
CCAGCTGGGG 
GGCCCCAAGG 
CACACTCTGA 
CAGGGGGCCT 
CGCCCAGCCC 
ACCCGCTGCC 
CTGGCCCCGC 
GCAGCAGGTG 
ACCCTCAAGG 
CGGAAAAGCA 
GGGTGGTGGA 
CCCTGCGGCG 
ACGGAGAGCC 
TGCTGGGAGA 
GCCTGCGGAC 
AGGGGGGCCG 
CTGTGTCCCT 
GGCCCCTGAC 
ACAACTACCT 
ACGAGGTCTT 
AGGCCCGAGC 
AATGCCTGTA 
CACTCCCCGC 
ACGTCACCGG 
GGGACGAGTG 
GCTTCCTGTA 
CAGTGACCGG 
ACTGCACCAC 
CGGTCTCTGG 
GCACCGGGGT 
CACTCTGAGT 
GGGATGGGCC 
GTTTCTGGAC 
CCCTCAGCCT 
CCTACTTACA 
TAGAGCCAGG 
GCCGAGAGAG 
AGCAAAAACG 
CCCCAGGTAC 
TAGAACCCTT 
CATACCCCCA 
TGGGGCTCTC 
ATGAGGCCTG 
ATTTGACTTC 
GAGTTGTGGG 
ACAATCCATG 
ACATCTGACT 
TGTTTTTTAT 



Seq ID NO: 32 protein Sequence 
Protein Accession &: XP_054631.1 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 



MPRGPAQPPA 
GEGGSPAGRS 
LGSCLDVLAF 
RAVLGVLVLP 
EEAPLHGCGL 
LVALDGLLYA 
FYRLLRYSPV 



11 

i 

QRPPGPAASS 
GALTEKQEBA 
AQQHGEPGLA 
SLYQGGRSGL 
CTMHNYLFLA 
IGGECLYSME 
KDAWDBCPYS 



21 
I 

SARRSQPVPQ 
RKLMVFLQRP 
QETYALMSDN 
PRGPRGEEPP 
GGIRGSGAKA 
CYDPRTDAWT 
ASHRRSSDIV 



31 41 51 

1 I i 

LRKRSRCEIA PSSEQEVRPA ASGDPQGEAP 60 

GGWGWEGPR KPSSRALEPA TAAALRRRLD 120 

LLRVLGDPCL YRRLSAADRE RILSLRTGRG 180 

AAAPVSLPLP AHLHVFNPRE NTWRPLTQVP 240 

VCSNEVFCYN PLTNIWSQVR PMQQARAQLK 300 

PRAPLPAGTF PVAHEAVACR GDIYVTGGHL 360 

AIX3GFLYRFD LLRGVGAAVM RYNTVTGSWS 420 

239 



WO 03/003906 



RAASLPLPAP APLHCTTLGN TIYCLNPQVT ATFTVSGGTA QPQAKELQPF PLGSTGVLSP 480 
FILTLPPEDR LQTSL 



Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



1 ii 21 31 41 Si 

| | I I I I 

CCTCTGTCTA ACCTTACATG AAAAAACCCG TTTCCAACGA AGGCCTCTAA AAGGCCAAGA 60 

AATCCCCTTG CAGACTTTAC AAACAGAGTG TTTCCTAACT GCTCTATGAA CAGAAAGGTT 120 

AAACTCTGTG AGTTGAACCA ACACATCACA ACGCAGTTTG TGGGAATGAT TCTGTCTAGT 180 

TTTGAAACGA AGATATTTCC TTTTCTGCCG TTGACCTTAA AGCGCTTGAA ATCTACACTT 240 

GCAAATTGCA CAAATAGAGT GTTTCAAATC TGCTCTGTCT AAGGGAACGT TCAACTCTGT 300 

GAGTTGAATG CACACAACAC AAGGAAGTTA CTGGGAATTC TTCTGTCTAG CCTTACATGA 360 

AAGAAAACCC GTTTCCAACG AAGGCCTCTA AGTGGTCAAA ATATCCACGT GCAGACTTTA 420 

CAAACAGAGT GTTTCCAAAC TGCTGAATGA AAAGAAAAGT AAACTCTGAG AGTTGAACGC 480 

ACACATCGCA GAGCAGTTTC TGAGAATGAT TCTGTCTAGT TTTTATACGA AGATATTTCC 540 

TTTTCTGCCT TTGGCCTCAA AGCGCTTGAA ATCTCCACTT GCAAATTCCA CAAAAAGAGT 600 

GTTTCAAATC TGCTCTGTCT AAAGGAAGGT TCAACTCTGT GAGTTGAATA CACACAACAC 660 

AAAGAAGTTA CTGAGAATTC TTCTGTCTAG CATTATATGA AGAAATCCCG TTTCCAACGA 720 

AGGCATCTAA GAGGTCCAAA TATCCACTTG CAGACTTTAC AAACAGAGGG TTTCCAGAAT 780 
GGCTGTATGA AAAGAAAGGT TAAACTCTGT GAGTTAAACA CACA 



Seq ID NO: 34 DNA sequence 

Nucleic Acid Accession 8: AF011468.1 

Coding sequences 257.. 1468 



60 
120 



1 11 21 31 41 51 

I | | I I I 

GGAAGACTTG GGTCCTTGGG TCGCAGGTGG GAGCCGACGG GTGGGTAGAC CGTGGGGGAT 
ATCTCAGTGG CGGACGAGGA CGGCGGGGAC AAGGGGCGGC TGGTCGGAGT GGCGGAGCGT 

CAAGTCCCCT GTCGGTTCCT CCGTCCCTGA GTGTCCTTGG CGCTGCCTTG TGCCCGCCCA 180 

GCGCCTTTGC ATCCGCTCCT GGGCACCGAG GCGCCCTGTA GGATACTGCT TGTTACTTAT 240 

TACAGCTAGA GGCATCATGG ACCGATCTAA AGAAAACTGC ATTTCAGGAC CTGTTAAGGC 300 

TACAGCTCCA GTTGGAGGTC CAAAACGTGT TCTCGTGACT CAGCAAATTC CTTGTCAGAA 360 

TCCATTACCT GTAAATAGTG GCCAGGCTCA GCGGGTCTTG TGTCCTTCAA ATTCTTCCCA 420 

GCGCGTTCCT TTGCAAGCAC AAAAGCTTGT CTCCAGTCAC AAGCCGGTTC AGAATCAGAA 480 

GCAGAAGCAA TTGCAGGCAA CCAGTGTACC TCATCCTGTC TCCAGGCCAC TGAATAACAC 540 

CCAAAAGAGC AAGCAGCCCC TGCCATCGGC ACCTGAAAAT AATCCTGAGG AGGAACTGGC 600 

ATCAAAACAG AAAAATGAAG AATCAAAAAA GAGGCAGTGG GCTTTGGAAG ACTTTGAAAT 660 

TGGTCGCCCT CTGGGTAAAG GAAAGTTTGG TAATGTTTAT TTGGCAAGAG AAAAGCAAAG 720 

CAAGTTTATT CTGGCTCTTA AAGTGTTATT TAAAGCTCAG CTGGAGAAAG CCGGAGTGGA 780 

GCATCAGCTC AGAAGAGAAG TAGAAATACA GTCCCACCTT CGGCATCCTA ATATTCTTAG 840 

ACTGTATGGT TATTTCCATG ATGCTACCAG AGTCTACCTA ATTCTGGAAT ATGCACCACT 900 

TGGAACAGTT TATAGAGAAC TTCAGAAACT TTCAAAGTTT GATGAGCAGA GAACTGCTAC 960 

TTATATAACA GAATTGGCAA ATGCCCTGTC TTACTGTCAT TCGAAGAGAG TT ATT CAT AG 1020 

AGACATTAAG CCAGAGAACT TACTTCTTGG ATCAGCTGGA GAGCTTAAAA TTGCAGATTT 1080 

TGGGTGGTCA GTACATGCTC CATCTTCCAG GAGGACCACT CTCTGTGGCA CCCTGGACTA 1140 

CCTGCCCCCT GAAATGATTG AAGGTCGGAT GCATGATGAG AAGGTGGATC TCTGGAGCCT 1200 

StTCTT TGCTATGAAT TTTTAGTTGO GAAGCCTCCT TTTGAGGCAA ACACATACCA 1260 

AGAGACCTAC AAAAGAATAT CACGGGTTGA ATTCACATTC CCTGACTTTG TAACAGAGGG 1320 
AG^ctc^AC CTCATTTCAA GACTGTTGAA GCATAATCCC AGCCAGAGGC CAATGCTCAG 1380 
AG^S ScCCCT GGATCACAGC AAATTCATCA AAACCATCAA ATTGCCAAAA 1440 
CAAAGAATCA GCTAGCAAAC AGTCTTAGGA ATCGTGCAGG GGGAGAAATC CTTGAGCCAG 1500 
GGCTGCCATA TAACCTGACA GGAACATGCT ACTGAAGTTT ATTTTACCAT ^GCTGC 1560 
CCTCAATCTA GAACGCTACA CAAGAAATAT TTGTTTTACT CAGCAGGTGT GCOTAACCT 1620 
ScTATTCAG AAAGCTCCAC ATCAATAAAC ATGACACTCT GAAGTGAAAG TAGCCACGAG 1680 
SSS C^tIcTGG TTCATAATCT GGAGGCAAGG TTCGACTGCA GCCGCCCCGT 1740 
CAGCCTGTGC TAGGCATGGT GTCTTCACAG GAGGCAAATC CAGAGCCTGG CTGTGGGGAA 1800 
^nScIcT CTGCCCTGAC CCCGATCAGT TAAGGAGCTG TGCAATAACC TTCCTAGTAC I860 
CTGAGTGAGT GTGTAACTTA TTGGGTTGGC GAAGCCTGGT AAAGCTGTTG GAATGAGTAT 1920 
r TTATTCT TT TTAAGTATGA AAATAAAGAT ATATGTACAG ACTTGTATTT TTTCTCTGGT 1980 
£SSS —CT GTGTGTCTGT CCGGCACCCC GGTAGGCCTG ATTGGGTITC 2040 
TAGTCCTCCT TAACCACTTA TCTCCCATAT GAGAGTGTGA AAAATAGGAA CACGTGCTCT 2100 
KSES Ig^GATTTGC TTGGGATACA GAAGAGGCCA TGTGTCTCAG AGCTGTTAAG 2160 
g^Statttt TTTAAAACAT TGGAGTCATA GCATGTGTGT AAACTTTAAA TATGCAAATA 2220 
AATAAGTATC TATGTCTAAA AAAAAAAAAA AAA 



Seq ID NO: 35 Protein Sequence 
Protein Accession ft: AAC63902.1 



60 
120 



1 " | l I" I" f 

Adrskencis gpvkatapvg gpkrvlvtqq ipcqnplpvn sgqaqrvlcp snssqrvplq 

AQKLVSSHKP VQNQKQKQLQ ATSVPHPVSR PLNNTQKSKQ PLPSAPENNP EEELASKQKN 

Sqhal edfeigrplg kgkfgnvyla rekqskfila lkvlfkaqle kagvehq^ 180 

EVEIQSHLRH PNILRLYGYP HDATRVYLIL EYAPLGTVYR ELQKLSKFDE QRTATYITEL 240 
ANALSYCHSK RVIHRDIKPB NLLLGSAGEL KIADFGWSVH APSSRRTTLC GTLDYLPPEM 300 

240 



WO 03/003906 



IEGRMHDEKV DLWSLGVLCY EFLVGKPPFE ANTYQETYKR ISRVEFTFPD FVTEGARDLI 360 
SRLLKHNPSQ RPMLREVLEH PWITANSSKP SNCQNKESAS KQS 

Seq ID NO: 36 DNA sequence 

Nucleic Acid Accession 8: NM_016267 

Coding sequence: 67.. 843 

1 11 21 31 41 51 

I I I I I I 

CGGCACGAGG ACAGAATCAG GTGATGGTCC AGAATTAAGA GCTGTCACCT GTGTCATTCA 60 

CTCACAATGG AAGAAATGAA GAAGACTGCC ATCCGGCTGC CCAAAGGCAA ACAGAAGCCT 120 

ATAAAGACGG AATGGAATTC CCGGTGTGTC CTTTTCACCT ACTTCCAAGG GGACATCAGC 180 

AGCGTAGTGG ATGAACACTT CTCCAGAGCT CTGAGCAATA TCAAGAGCCC CCAGGAATTG 240 

ACCCCCTCGA GTCAGAGTGA AGGTGTGATG CTGAAAAACG ATGATAGCAT GTCTCCAAAT 300 

CAGTGGCGTT ACTCGTCTCC ATGGACAAAG CCACAACCAG AAGTACCTGT CACAAACCGT 360 

GCCGCCAACT GCAACTTGCA TGTGCCTGGT CCCATGGCTG TGAATCAGTT CTCACCGTCC 420 

CTGGCTAGGA GGGCCTCTGT TCGGCCTGGG GAGCTGTGGC ATTTCTCCTC CCTGGOGGGC 480 

ACCAGCTCCT TAGAGCCTGG CTACTCTCAT CCCTTCCCCG CTCGGCACCT GGTTCCAGAG 540 

CCCCAGCCTG ATGGGAAACG TGAGCCTCTC CTAAGTCTCC TCCAGCAAGA CAGATGCCTA 600 

GCCCGTCCTC AGGAATCTGC CGCCAGGGAG AATGGCAACC CTGGCCAGAT AGCTGGAAGC 660 

ACAGGGTTGC TCTTCAACCT GCCTCCCGGC TCAGTTCACT ATAAGAAACT ATATGTATCT 720 

CGTGGATCTG CCAGTACCAG CCTTCCAAAT GAAACTCTTT CAGAGTTAGA GACACCTGGG 780 

AAATACTCAC TTACACCACC AAACCACTGG GGCCACCCAC ATCGATACCT GCAGCATCTT 840 

TAGTCAAGTT GGAGGAGAAA GACAACACTT GGTCTAAGAC ACGGCAGCAA GACATCCCTG 900 

CATATTGTTC CAGATAAAAA TGAAAGCTGC TCACACCCAC TTGCCTCCCC AATCTGTTAA 960 

ACAGCTTCGT GTCTAGTATG AGCTCAGTAC TTGCCCTGTG AAAATCCCAG AAGCCCCCGC 1020 

TGTCAATGTT CCCCATCCAC ACCCTGCTTG CTCCTGTGTA ACAGCTCAGA TGATGAATAA 1080 
TAATAAAACT GTACTTTTTT GGAAAAAAAA AAAAAAAAAA AAAAAA 



Seq ID NO: 37 Protein Sequence 
Protein Accession NP_057351.1 

1 11 21 31 41 51 

1 ! I I 1 I 

MEEMKKTAIR LPKGKQKPIK TEWNSRCVLF TYPQGDISSV VDEHFSRALS NIKSPQELTP 60 

PSSQSEGVML KNDDSMSPNQ WRYSSPWTKP QPEVPVTNRA ANCNLHVPGP MAVNQFSPSL 120 

AARRASVRPG ELWHFSSLAG TSSLEPGYSH PFPARHLVPE PQPDGKREPL LSLLQQDRCL 180 

ARRPQESAAR ENGNPGQIAG STGLLFNLPP GSVHYKKLYV SRGSASTSLP NETLSELETP 240 
GKYYSLTPPN HWGHPHRYLQ HL 



Seq ID NO: 38 DNA sequence 

Nucleic Acid Accession #: AK058088.1 

Coding sequence: 252.. 1772 



AGGAAACCAA 
GCCCCAGATG 
TAACTCAGAG 
CCTGTGGTGG 
CACTTTCATT 
GCTCAACCAA 
CCCAGATGTG 
TATGCATTTC 
GTGGGGAAGC 
GGCTTTTTCA 
CACACCCCCT 
AAAAATCCCC 
AAGTCGCCTT 
AAGACCTACT 
GGTGCGTGGA 
CCGGCTGCAA 
ACTTGGTTCC 
ACAAAGTGCT 
TGCTTTTGTG 
CCGTGATGAC 
GACGGAGCTT 
ATCTGGAGGT 
TCAACCATTA 
CCCAGGCTGT 
TTGCTCTATA 
GTTTTTACAT 
ACAACCACGT 
TGGTTGACGG 
CCAC GTCAT C 
TAAGTTTTGC 
CCCCCCACTG 



11 
I 

GGCAAGCTCC 
TGATGAGCGC 
GCCAGTGTGA 
GCAAAAGGGC 
CATGATAATG 
CCGGTGGCGT 
CAAGGCTCTG 
ACAAATCCAG 
AGATGTGATT 
GTCTGAGACC 
GAGGGAGCTG 
TGCAAAGAGG 
CGCCACGGCC 
GGGAGTGCTG 
TGTGATGATA 
GTACAAGGAA 
TCTAGGGGGG 
GAAGTCCCCC 
GGTCTTCTTG 
ATTTTTTAAG 
GAGGCCGCTC 
GCTGCGGCAC 
CCACGCACTG 
GAGATTTGGG 
TGGATTCTTC 
GCAGAGAATA 
CAGCCTGCGA 
CAAGTGGCAG 
CTGCAAAAGC 
ATTCATCTTC 
GTCTGCATAA 



21 
I 

CCCTGTCAAA 
TTCCAGGCTT 
TGGGAGTTCC 
TTTGGAACGG 
TOGAACAOGC 
TTGCCCAAAC 
AGCATAGACT 
AAGTTTTTCT 
CTCGAGTGCC 
TTGGCCAAGC 
GAGGAGCTTC 
ATCATCATTT 
CTGAAGAACC 
GCTTCCGCCC 
GCCAGACTCA 
GAGCAGCTCA 
ACGCAGATCC 
AGGTTATTTA 
CAACTGAACT 
AGCTTTCCTG 
TTCCTCTGCT 
CTTAACTTCT 
GAGAATGGGG 
CTGCTCTTTA 
TTTAAGATAA 
AAGCACACAG 
GCGGCACGCC 
GAGTTCAGGA 
CATACCTTGA 
CCAGCATCTT 
AAGAAAATAA 



31 



GCACCTTGGC 
CAGGCTCAGA 
TCCACTCAGC 
TTGCTTGTCT 
ACAAAGCTCG 
AGCCTTTCTC 
TTGAGGAAGC 
TTGAGAATTT 
TGGGCTTCAA 
TCTACCTGAA 
TGCGAGCTCA 
CCTTGAAGAT 
TCTACATGAG 
ACATCCTCCA 
AGCCAAGCAC 
CCACCGGCTG 
ACCTCCACAA 
CCTTTAGTGA 
ACAAGATTCA 
AGAACTGTTG 
TGCGTCTGCA 
TCCCAGAGTC 
GCGACATGGT 
ACCAGGAGAA 
AGGGACTCAA 
ACCTGGAATC 
TGGTGAAGTA 
CAAACCAGAT 
AAATCCAAAC 
GACAGTTTCC 
AATGACATAA 



41 

I 

CCATAAGAAG 
AGGCGCCCCC 
ACACTTCCCC 
TTTCTCTCCT 
GCTGGAACGC 
TGGGGACCTG 
TTTGAGGAAC 
CAAGAACAAG 
ATGGGAGCTC 
AGCCCTGGCG 
ATCACCTAAG 
CAATGACCCA 
TGAGGTGGAG 
GTTCAGTGGC 
CATCAAGAAA 
CGAGAAGTGG 
AATCCCACAG 
ATTCCATCTT 
GGCAATTCCG 
CTTTCTGGAC 
CGGCATCACC 
ATGGCTCGAC 
CCACCTGAAA 
TACAACTTAT 
ACATGATACT 
TCCCTCTGCG 
TGAGATCAGA 
CAAGCAGAAG 
TGTGGGCATC 
AGAAGAATCT 
AAGGGAGC 



51 
I 

AAAAGGGGGA 
AGCTCTCCTG 
TGTAAACACG 
GCGTAATTTC 
CGGGTCACTG 
CTCTCACTTT 
CCAGACAGGT 
QACATCCAAA 
CATCAGCCCC 
CAGGGCACCA 
AAGACCAAAG 
CTGGTCACTA 
ATTAACTTGG 
CTGTTTCAAA 
TTCTACGAGG 
CTGGAAATGA 
GACCTGCTCC 
CTGAAAACAA 
ACTTATGAAA 
CGGGACATAG 
AAAGGCAAGG 
CAGGTTACAG 
GATCTTAACA 
TCGAAAACGA 
ACCTCTTATA 
GTCTACGAGC 
GCAGAGGCCC 
TTTGGGTTGA 
CCAATCTATG 
ATGGGATTTT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 



241 
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Seq ID NO: 39 Protein Sequence 
Protein Accession fi: BAB7165B.1 



1 11 21 31 41 51 

I I I I I 1 

MIMSNTHKAR LERRVTGSTN RWRLPKQPFS GDLLSLSQMC KALSIDFEEA LRNPDRLCIS 60 

QIQKFFFENF KNKDIQSGBA DVILECLGFK NELHQPRLFQ SETLAKLYLK ALAQGTTHPL 120 

RELEELLRAQ SPKKTKEKSP AKRIIISLKI NDPLVTKVAF ATALKNLYMS EVEINLEDLL 180 

GVLASAHILQ FSGLFQRCVD VMIARLKPST IKKFYEAGCK YKEEQLTTGC EKWLEMNLVP 240 

LGGTQIHLHX IPQDLLHKVL KSPRLFTFSE FHLLKTMLLW VFLQLNYKIQ AIPTYETVMT 300 

FFKSFPENCC FLDRDIGRSL RPLFLCLRLH GITKGKDLEV LRHLNFFPES WLDQVTVNHY 360 

HALENGGDMV HLKDLNTQAV RFGLLFNQEN TTYSKTIALY GFFFKIKGLK HDTTSYSFYM 420 

QRIKHTDLES PSAVYEHNHV SLRAARLVKY EIRAEALVDG KWQEFRTNQI KQKFGLTTSS 480 
CKSHTLKIQT VGIP1YVSFA FIFPAS 



Seq ID NO: 40 DNA sequence 

Nucleic Acid Accession #: NM_032899.1 

Coding sequencer 186.. 1070 

1 11 21 31 41 51 

I I I I II 

GGCACGAGGA TTGCTCAGGA CAGCGGTAAA TCACTTCTTG GAGGTGCCCT GCACGCCGGT 60 

CCTGGGAGCA GGCGGCCTCC CGGGGGTGCG GGAGCCCCAC TCCTCCGTGG TGTGTTCCAT 120 

TTGCTTCCCA CATCTGGAGG AGCTGACGTG CCAGCCTCCC CCAGCACCAC CCAGGGACGG 1BO 

GAGGCATGAG CCGGTCAAGG CACCTGGGCA AAATCCGGAA GCGTCTGGAA GATGTCAAGA 24 0 

GCCAGTGGGT CCGGCCAGCC AGGGCTGACT TTAGTGACAA CGAGAGTGCC CGGCTGGCCA 300 

CGGACGCCCT CTTGGATGGG GGTTCTGAAG CCTACTGGCG GGTGCTCAGC CAGGAAGGCG 360 

AGGTGGACTT CTTGTCCTCG GTGGAGGCCC AGTACATCCA GGCCCAGGCC AGGGAGCCCC 420 

CGTGTCCCCC AGACACCCTG GGAGGGGCGG AAGCAGGCCC TAAGGGACTG GACTCCAGCT 4 B0 

CCCTACAGTC CGGCACCTAC TTCCCTGTGG CCTCAGAGGG CAGCGAGCCG GCCCTACTGC 540 

ACAGCTGGGC CTCAGCTGAG AAGCCCTACC TGAAGGAAAA ATCCAGCGCC ACTGTGTACT 600 

TCCAGACCGT CAAGCACAAC AACATCAGAG ACCTCGTCCG CCGCTGCATC ACCCGGACTA 660 

GCCAGAACAT TTCCATCOGG AGTGTGGAAG GAGAGATATA CTGTGCCAAG TCAGGCAGGA 720 

AATTCACTGG CCAAATCCGG GAGAAGTTCA TCATCTCGGA CTGGAGATTT GTCCTGTCTG 780 

GATCTTACAG CTTCACCTGG CTCTGOGGAC ACGTGCACCG GAACATCCTC TCCAAGTTCA 840 

CAGGCCAGGC GGTGGAGCTG TTTGACGAGG AGTTCCGCCA CCTCTACGCC TCCTCCAAGC 900 

CTGTGATGGG CCTGAAGTCC CCGCGGCTGG TCGCCCCCGT CCOGCCCGGA GCAGCCCCGG 960 

CCAATGGCCG CCTTAGCAGC AGCAGTGGCT CCGCCAGTGA CCGCACGTCC TCCAACCCCT 1020 

TCAGCGGCCG CTCGGCAGGC AGCCACCCCG GTACCCGAAC TGACGGCTGA GATGAGGTTA 1080 

GAATGACTGG GCCCGGCTGA ACATTCCAAA TTGGATTTCA CCATCTGCTG AGAAAGTTTA 1140 

AGGAAGGCAA AGCTTGCCAG GTCACAGAAG CTCCCAAGCC CAGCTTTCCA AAGGCCTCAG 1200 

CCTGTGCCTG TGTCGAGCTC AGTCCTGGGA GATAGGGGAG AACCTGCAGG CAGGAACAAG 1260 

CCCCCCTACT CCTGACCACC CTCCATCAGC AGTCTCCCCT CCGTGGTCGT CTTTGTTGAC 1320 

AAAGGTGCAG TTTCTCCTCT CCTGGGCACC TGTAACATGT GATGCGCTGC CTGCTGGGAG 1380 

GTTAGGTCGG GGCTGCCCCG GCGAGTGGAG CATGAGCAGA ACCGCCGAGG GTCACTTCTG 1440 

GGCAGAAGCT TTGAGAGCCT GGGTCCAGGT TGCCACATAG AAGCAGCTCT CCAGTTGAAA 1500 

CCCTCCTCTG CCAGCCTGGG GTCCTAAGCG ATGAGCAGAA TCCCCCACTC CCACCCCACC 1560 

AACCCACAAT GGATATGTAG TGAGCAAGAA ATAAACCTTT GTTGTTTAAA AAAAAAAAAA 1620 
AAAAAAAA 



Seq ID NO: 41 Protein Sequence 
Protein Accession »: NP_116288.1 

1 11 21 31 41 51 

MSRSRHLGKI RKRLEDVKSQ WVRPARADFS DNESARLATD ALLDGGSEAY WRVLSQEGEV 
DFLSSVEAQY IQAQAREPPC PPDTLGGAEA GPKGLDSSSL QSGTYFPVAS EGSEPALLHS 
WASAEKPYLK EKSSATVYFQ TVKHNNIRDL VRRCITRTSQ NISIRSVEGE IYCAKSGRKF 
TGQIREKFII SDWRPVLSGS YSFTWLCGHV HRNILSKFTG QAVELFDEEF RHLYASSKPV 
MGLKSPRLVA PVPPGAAPAN GRLSSSSGSA SDRTSSNPFS GRSAGSHPGT RTDG 



60 
120 
180 
240 



Seq ID NO: 42 DNA sequence 

Nucleic Acid Accession «t NM_000782.2 

Coding sequence: 405.. 1946 



1 " f I 1 I' I' 

TGGAGAGGGA CAGGAGGAAA CGCAGCGCCA GCAGCATCTC ATCTACCCTC CTTGACACCT 60 
CCCCGTGGCT CCAGCAAGCC CTAGAGGTCA GCCTTGCGGA CCAACAGGAG GACTCCCAGC • ' 
TTTCCCTTTT CAAGAGGTAC CCCAGACACC GGCCACCCTC TTCCAGCCCC TGCGGCCAGT 
GCAAGGAGGC ACCAATGCTC TGAGGCTGTC GCGTGGTGCA GCGTCGAGCA TC CTCGCC GA 
GTCCTTCTGC TGCCTGTCCC GCCTCACCCC GCTCCATCAC ACCAGCTGGC CCTCTTTGCT 
TrCTTTTCCC AGAATCGTTA AGCCCCGACT CCCACTAGCA CCTCGTACCA ACCTCGCCCC 360 
IcSS^OT C^OTTCC CGCGCTCCGG TGTCCCCCGC TGCCATGAGC TCCCCCATCA 420 
GCAAGAGCCG CTCGCTTGCC GCCTTCCTGC AGCAGCTGCG CAGTCCGAGG CAGCCCCCGA 
GACTGGTGAC ATCTACGGCG TACACGTCCC CTCAGCCGCG AGAGGTGCCA GTCTGCCCGC 
TGACAGCTGG TGGCGAGACT CAGAACGCGG CCGCCCTGCC GGGCCCCACC AGCTGGCCAC 
TGCTGGCGAG CCTGCTGCAG ATTCTCTGGA AAGGGGGTCT CAAGAAACAG CACGACACCC 

242 



120 
180 
240 
300 



420 
480 
540 
600 
660 
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PCT7US02/21338 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



TGGTGGAGTA 
CGGTGCACCT 
AGCGGCTGGA 
TGCTGATCCT 
TGAAACCAGG 
TGGGCAGAAT 
TGAACAAATG 
TCCAGAAGAA 
GCACGTTTGG 
TCTGGCAGGG 
ACAACCGGTT 
ACCAGAATCG 
CGGTGGAAAC 
AAGTGCAACA 
GGGAGGAAGA 
TTACCCCGGG 
ATGCTTTACC 
ACAATTTTGA 
TTAATCCTTT 
TAGCAGAGCT 
CCACAGACAA 
TCCCCATCGC 
TCCAACTCAG 
GGTTTACACT 
TTCTGTCTTG 
TTACAAACCG 
TAAGTTTACA 
TACTGACAAT 
TAATAATGTA 
TCTTCCTGCA 
GGACTTTAAC 
GAGGACCTTA 
GAAGCAATAA 
TGTCTGTGTG 
ATTTTTTTAA 
TTAGTCACAG 
ATAATTTTTC 
ATATAACATT 
TGCACAGAAA 
TGTGCATATT 
ATAAATATTT 
TATATTGTAA 
TTTAGAATTA 
GTAAAAATAA 



CCACAAGAAG 
GGGCTCGCCA 
GATCAAACCG 
GGAAGGGGAA 
GGAAGTGATG 
AGATGAGCTC 
GTCGTTTGAA 
TGCAGGGGAT 
GAGGATGATG 
ACACACTCTG 
AGAGAAGTAT 
GCTTTCAAAG 
GACAGCAAAC 
AAAGCTTCTT 
TTTGAGGAAT 
TGTACCATTT 
CAAAGGAACA 
AGATTCAAGT 
TGCGCATCTT 
TCAACTGCAT 
TGAGCCTGTT 
GTTTTGCCAG 
GGAAGCGGAC 
TCACCTGTGT 
CATCTTCCAT 
TTTCAGGCCC 
TTTCACAATG 
CCATTTACTG 
AGCCAAGCTA 
TTGAAATAAA 
GAAGTCAAAC 
AAATTGTTTA 
TTTTTAAATT 
TGTGTGTCCG 
TATTTGCAGT 
CTTAAAAACA 
TGTCTATTAA 
TGTTTTGTGA 
TGCATGCAAT 
TTTACTACTG 
CTTCCTACAT 
TAAATATGTA 
TAATTCAGGT 
AAGTTTTTAA 



TATGGCAAGA 
TGCCTGCTGG 
TGGAAGGCCT 
GACTGGCAGC 
AAGCTGGACA 
TGTGATGAAA 
AGTATCTGCC 
GAAGCTGTGA 
GTCACTCCAG 
GCCTGGGACA 
TCTCAGCAGC 
AAAGAATTGT 
AGTCTAATGT 
AAGGAAATTC 
ATGCOGTATT 
ACAACTCGGA 
GTGCTCATGC 
CAGTTTAGAC 
CCATTTGGCG 
TTGGCTCTTT 
GAGATGCTAC 
CGATAATACG 
TGAGTGCTGG 
CAGCACCATC 
GACATGAAAG 
TGCTACCACA 
AATGACCTAC 
TAGCTCTGCT 
AATGATGGTT 
AACATTATTG 
AACTTCTAAG 
TTTTTTAAAT 
GTGTATGTGC 
TGTATGTGTG 
ACTTGATTTA 
TTCAGTGTGA 
ATGGGACAAG 
TTTGAACTCC 
ACTTATAAGA 
CTTTTCCTAA 
TATATTTTTA 
CATATCTAGG 
CTTGTCATAT 
ATATT 



TTTTCCGCAT 
AAGCGCTGTA 
ATCGCGACTA 
GGGTCCGGAG 
ACAAAATCAA 
GAGGCCACGT 
TCGTGTTGTA 
ACTTCATCAT 
TCGAGCTGCA 
CCATTTTCAA 
CTAGTGCAGA 
ATGCTGCTGT 
GGATTCTCTA 
AAAGTGTATT 
TAAAAGCCTG 
CTCTTGACAA 
TAAATACCCA 
CTGAACGTTG 
TTGGAAAAAG 
GTTGGATTGT 
ACTCAGGCAC 
CCTCAGATGG 
GATCCAAGGC 
TTCAGGTGCT 
GGAGGCTGGC 
TTCACTGTTT 
AACAGCTAAA 
TAATGTACTA 
AAAGTTATCA 
GGAAACTAGA 
ACTAGTGATT 
GTGATTCCTT 
ATATGAAGCA 
TGTGGGTTCT 
GGATCTGTGG 
CTTTAATATT 
TGTAAAGCAT 
TAGGAAAAAT 
CTTAAAAATT 
ATGCATACTG 
GAATATTTCA 
TATATGCTTT 
TTCATCTGTA 



GAAGTTGGGT 
CCGCACCGAG 
CCGCAAAGAA 
TGCCTTTCAA 
TGAGGTCTTG 
CGAAGACTTG 
TGAGAAGAGA 
GGCCATCAAA 
CAAGAGCCTC 
ATCAGTCAAA 
TTTCCTTTGT 
CACAGAGCTC 
CAATTTATCC 
ACCTGAGAAT 
TCTGAAAGAA 
GGCAACAGTT 
GGTGTTGGGA 
GCTTCAGGAG 
AATGTGCATT 
CCGCAAATAC 
CCTGGTGCCC 
TGGTATTTGC 
ATTCTACAGG 
TAGAATGGCC 
ACTTGTCAGT 
GAATCTTTAA 
TTTTCTGGGG 
CTTAGGAAAA 
GGCCTCCCAT 
GAACACCTCT 
CACTGGGGCA 
TATGGCATTA 
CAGACATGCA 
AATGGTAATT 
TGCAGGGGAT 
ATAAAATGAT 
GCAAAAGTTA 
ATGATTTCAT 
GTGTTTACAG 
TATATAATTC 
GAAATATACA 
CTCTCTGCTG 
TACCTTCAAA 



TCCTTTGAGT 
AGCGTACCCC 
GGCTACGGGC 
AAGAAACTAA 
GCCGATTTTA 
TACAGCGAAC 
TTTGGGCTTC 
ACAATGATGA 
AACACCAAGG 
GCTTGTATCG 
GACATTTATC 
CAGCTGGCTG 
CGTAATCCCC 
CAGAGGCCAC 
TCTATGAGGC 
CTGGGTGAAT 
TCCAGTGAAG 
AAGGAAAAAA 
GGTCGCCGAT 
GACATCCAGG 
AGCCGGGAAC 
TAACATCATA 
GTTCACTGCT 
TGGGAGCCTG 
CAGGTAGAGG 
TTCCCAAGAA 
CTGGGAGTAA 
TGTCCCTGCT 
GAAATTGCGT 
ATTTTTAAAA 
TTATTTGTTA 
GGGTAAAGAT 
TGTGTGTGTG 
TGCCTCAGTC 
GTTTCAAAGT 
TTCCCATGCC 
GAGATCTGTT 
AAATGTAAAA 
ATGGTTTATT 
TGTGTATTTG 
TTTATGTCTT 
TGAAATTATT 
TTCTCTGAAA 



Seq ID NO: 43 Protein Sequence 
Protein Accession #: NP_000773.1 



1 
I 

MSSPISKSRS 
PTSWPLLASL 
TBSVPQRLEI 
VLADPMGRID 
IKTMMSTFGR 
LCDIYHQNRL 
ENQRPREEDL 



KYDIQATDNE 



11 
I 

LAAFLQQLRS 
LQILWKGGLK 
KPWKAYRDYR 
ELCDERGHVE 
MMVTPVELHK 
SKKELYAAVT 
RNMPYLKACL 
SSQFRPERWL 
PVEMLHSGTL 



21 
I 

PRGPPRLVTS 
KQHDTLVEYH 
KEGYGLLILE 
DLYSELNKWS 
SLNTKWfQGH 
ELQLAAVETT 
KESMRLTPGV 
QEKEKINPFA 
VPSRBLPIAF 



31 

I 

TAYTSPQPRE 
KKYGKIFRMK 
GEDWQRVRSA 
FESICLVLYE 
TLAWDTIPKS 
ANSLMWILYN 
PFTTRTLDKA 
HLPPGVGKRM 
CQR 



41 

1 

VPVCPLTAGG 
LGSFESVHLG 
PQKKLMKPGE 
KRFGLLQKNA 
VKACIDNRLE 
LSRNPQVQQK 
TVLGEYAIiPK 
CIGRRLAELQ 



Seq ID NO: 44 DNA sequence 

Nucleic Acid Accession 8: NM_007000.2 

Coding sequence! 1-777 



1 
I 

ATGGCGTCTG 
GTTGTGGGCA 
GTGACAGCCG 
TTCGCTGGTG 
GGTGTGGGTG 
CTCATCGTCT 
ATGGTGTCCA 
GACCAGGGCC 
GGCACATCTG 
GAGGTGGTGT 
AACGAGGAGG 
CACATCGGCC 



11 
I 

CGGCAGCAGC 
ATATCATTAT 
ACCAGTACCG 
CCTGGATTGC 
CCGCACTCTG 
ACATCTTCGA 
ACCCATCCCT 
AGGAGCTGAC 
GTCCCATGGA 
TCCCCTGGCC 
GCTGCOGCCT 
ACGCCATCGA 



21 
I 

GGAGGCCGAG 
TCTGCTGTCA 
TGTATACCCA 
CATCTTCTGC 
CCGCCGCCGG 
GTGCGCCTCC 
GATCACCAAG 
CCGCCTCTGG 
CTGGGTGAAC 
CCCACTGTGC 
GGGGCACATG 
CAGCTACACG 



31 
I 

AAGGGATCTC 
GGCCTGTCCC 
CTGATGGGAG 
GGCTTCTCCT 
TCCATGGTCC 
TGCATCACGT 
CAGATGCTGA 
GACCGCGTCA 
TTCACGTCAG 
TGTCGCCGGA 
GACTACCTGT 
TGGGGTATCT 



41 
I 

CAGTTGTGGT 
TGTTTGCTGA 
TCTCAGGCAA 
TCTTCATGGT 
TCACGTACCT 
CCTACACCCA 
CCTTCTACAG 
TGATTGAGCA 
CCTTCCGGGC 
CGGGAAACTT 
TCACCAAGGG 
CGTGGTTTGG 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 



51 
I 

ETQNAAALPG 
SPCLLEALYR 
VMKLDNKINE 
GDEAVNPIMA 
KYSQQPSADF 
LLKBIQSVLP 
GTVLMLNTQV 
LHLALCWIVR 



60 

120 

180 

240 

300 

360 

420 

480 



51 
I 

GGGCCTGCTA 
GACCATATGG 
GGATGACGTC 
AGCCAGTTTT 
GGTGCTCATG 
CCGTGACTAC 
CGCGGACACC 
AGAATGCTGT 
GGCCACTCCG 
CATCCCCCTC 
CTGCTTCGAA 
GTTTGCCATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



243 



WO 03/003906 



PCT/US02/21338 



CTGATGTGGA CGCTCCCGGT CATGCTGATA GCCATGTATT TCTACACCAT GCTCTGAGGG 780 
ACAGGAGGGG AAGGCAACAT ACACACCCCG GACTCCTCCG CATCCTCCTC CTGCTTCCTC 840 
CGCTGGGCCT GGATGGCTGC CTCACCTCTC ACCTCCCAAC GTCCCTAGCC CTTACGTCCT 900 
TCCACTTCCA AGATCTTTTT CCAGGTTCCT GAGCCCTACT GTGTCTCAGG TGTGCCCTGA 960 
5 AACCCCAGGG CTTGTGTGCA CATATCCTTA GCCCATCTTT CAAGGGACCT CTCCATGATC 1020 
CCACCTCCCA TTCACAGATA CCTCTCTTGT AGCTCTCTGA CCTCCTCCTT CATGGCAGGC 1080 
ATCGCCATTC TTGCTGAACC GTTTGTGATT GCCATTTGAG CTCTGGAAGC CTCTATTGCC 1140 * 
ATGAGAGTTC TGTCACX3GTC ACTTTACTGT CCCCATCATC ACCCAGCACG GGGCTAAGCA 1200 
TATACTAGAT AGTCAATAAA TAA 

10 

Seq ID NO: 45 Protein sequence: 
Protein Accession ft: NPJJ08931.1 

1 11 21 31 41 51 

15 | 1 | I 1 I 

MASAAAAEAE KGSPVWGLL WGNIIILLS GLSLFAETIW VTADQYHVYP LMGVSGKDDV 60 
FAGAWIAIFC GFSFFMVASF GVGAALCRRR SMVLTYLVLM LIVYIFECAS CITSYTHRDY 120 
MVSNPSLITK QMLTFYSADT DQGQELTRLW DRVMIEQECC GTSGPMDWVN FTSAFRAATP 180 
EWFPWPPLC CRRTGNFIPL NEEGCRLGHM DYIiFTKGCFE HIGHAIDSYT WGISWFGFAI 240 
20 LMWTLPVMLI AMYFYTML 



25 



Seq ID NO: 46 DNA sequence 

Nucleic Acid Accession #: NM_006760.1 

Coding sequence : 39.. 593 



1 11 21 31 41 51 

GAAAGCCTGC CAGCACCTAT TCCACCTCCC AGCCCAGCAT GGCACCCCTG CTGCCCATCC 60 

GGACCTTGCC CTTGATCCTG ATTCTGCTGG CTCTGCTGTC CCCAGGGGCT GCAGACTTCA 120 

30 ACATCTCAAG CCTCTCTGGT CTGCTGTCCC CGGCGCTAAC GGAGAGCCTG CTGGTTGCCT 180 

TGCCCCCCTG TCACCTCACA GGAGGCAATG CCACACTGAT GGTCCGGAGA GCCAATGACA 240 

GCAAAGTGGT GACGTCCAGC TTTGTGGTGC CTCCGTGCCG TGGGCGCAGG GAACTGGTGA 300 

GTGTGGTGGA CAGTGGTGCT GGCTTCACAG TCACTCGGCT CAGTGCATAC CAGGTGACAA 360 

ACCTCGTGCC AGGAACCAAA TTCTACATTT CCTACCTAGT GAAGAAGGGG ACAGCCACTG 42 0 

35 AGTCCAGCAG AGAGATCCCA ATGTCCACAC TCCCTCGAAG GAACATGGAA TCCATTGGGC 480 

TGGGTATGGC CCGCACAGGG GGCATGGTGG TCATCACGGT GCTGCTCTCT GTCGCCATGT 540 

TCCTGCTGGT GCTGGGCTTC ATCATTGCCC TGGCACTGGG CTCCCGCAAG TAAGGAGGTC 600 

TGCCCGGAGC AGCAGCTTCT CCAGGAAGCC CAGGGCACCA TCCAGCTCCC CAGCCCACCT 660 

GCTCCCAGGC CCCAGGCCTG TGGCTCCCTT GGTGCCCTCG CCTCCTCCTC CTGCCCTCCT 72 0 

40 CTCCCCTAGA GCCCTCTCCT CCCTCTGTCC CTCTCCTTGC CCCCAGTGCC TCACCTTCCA 780 

ACACTCCATT ATTCCTCTCA CCCCACTCCT GTCAGAGTTG ACTTTCCTCC CATTTTACCA 840 

CTTTAAACAC CCCCATAACA ATTCCCCCAT CCTTCAGTGA ACTAAGTCCC TATAATAAAG 900 
GCTGAGGCTG CATCTGCCAA AAAAAAAAAA AA 

45 Seq ID NO: 47 Protein Sequence 

Protein Accession ft: NP_006751.1 

1 11 21 r i 1 r 

50 MAPLLPIRTL PLILILIiALL SPGAADFNIS SLSGLLSPAL TESLLVALPP CHLTGGNATL 60 
MVRRANDSKV VTSSFWPPC RGRRELVSW DSGAGFTVTR LSAYQVTNLV PGTKFYISYL 120 
VKKGTATBSS REIPMSTLPR RNMESIGLGM ARTGGMWIT VLLSVAMFLL VLGFIIALAL 163 
GSRX 

55 Seq ID NO: 48 DNA sequence 

Nucleic Acid Accession ft: EOS sequence 
Coding sequence: 200-2932 

1 11 21 31 41 51 

^ ATTGCTGATG GATCAGTGAG CCTGTGTTCA TGCCAGTGAG CTGCTGTGGC TCAGATACTG 
ATACTTTCTT TCCAAACAGC ATAAGAAGTG ATTGAGCCAC AAGTATACTG AAGGAAGGGC 
TCCCTCGAGT TCTGGTGTGA AGAGATAAAT CACCAGTCAC AGACTATGCA CCCGACTGCT 180 
GCTGTTCAGT CCAGGGAAAA TGAAAGTTGG AGTGCTGTGG CTCATTTCTT TCTTCACCTT 240 

65 CACTGACGGC CACGGTGGCT TCCTGGGGAA AAATGATGGC ATCAAAACAA AAAAAGAACT 300 

CATTGTGAAT AAGAAAAAAC ATCTAGGCCC AGTCGAAGAA TATCAGCTGC TGCTTCAGGT 360 
GACCTATAGA GATTCCAAGG AGAAAAGAGA TTTGAGAAAT TTTCTGAAGC TCTTGAAGCC 42b 
TCCATTATTA TGGTCACATG GGCTAATTAG AATTATCAGA GCAAAGGCTA C CACA GACTG 
CAACAGCCTG AATGGAGTCC TGCAGTGTAC CTGTGAAGAC AGCTACACCT GGTTTCCTCC 

70 CTCATGCCTT GATCCCCAGA ACTGCTACCT TCACACGGCT GGAGCACTCC CAAGCTGTGA 
ATGTCATCTC AACAACCTCA GCCAGAGTGT CAATTTCTGT GAGAGAACAA AGATTTGGGG 
CACTTTCAAA ATTAATGAAA GGTTTACAAA TGACCTTTTG AATTCATCTT CTGCTATATA 
CTCCAAATAT GCAAATGGAA TTGAAATTCA ACTTAAAAAA GCATATGAAA GAATTCAAGG 
TTTTGAGTCG GTTCAGGTCA CCCAATTTCG AAATGGAAGC ATCGTTGCTG GGTATGAAGT 

75 TGTTGGCTCC AGCAGTGCAT CTGAACTGCT GTCAGCCATT GAACATGTTG CCGAGAAGGC 
TAAGACAGCC CTTCACAAGC TGTTTCCATT AGAAGACGGC TCTTTCAGAG TGTTCGGAAA 
AGCCCAGTGT AATGACATTG TCTTTGGATT TGGGTCCAAG GATGATGAAT ATACCCTGCC 
CTGCAGCAGT GGCTACAGGG GAAACATCAC AGCCAAGTGT GAGTCCTCTG GGTGGCAGGT 

244 



60 
120 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



WO 03/003906 



PCT/US02/21338 



5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



CATCAGGGAG 
TGTAGGCAAT 
TCGGCAAAAC 
TATTTCATCT 
CATCAGTATA 
GCGGGAAGAA 
GGTGCCTCCG 
TCCAGTGAAC 
AAATACATCT 
CCTTCCAGAA 
CAAAAATGGA 
CATAAATGAA 

AACTCAAGAC 
ACCTTTTGTC 
TATCTCCATT 
TAAAAAAAGC 
CTTGATTGCT 
TGGAGTCTGC 
GATGCTCATG 
CCAGCATTTG 
TGTCATTACC 
GCTTAACTGG 
TGTGGCTGTG 
TGGGGAAAGA 
CATTCTGACC 
CCAGAATCTG 
CTTATGCTTT 
TGCCTTAAGT 
CAAATTCTCA 
TGGAGATTCC 
ATCATAAAAT 
AGAAATTATG 
AATGTCTCAG 



ACTTGTGTGC 
GCCACTGAGG 
CCATCAACCA 
CTGTCACTGG 
GCTGACAATA 
AAGTATGCCA 
ACAGCTCTTC 
AAAAGCCAAC 
ATTCCCATCA 
ACTATTATCA 
AATGCTCAGG 
GTTTTCCTAT 
GATTTCAGTC 
ATCGTGACGT 
CCCTCTACAA 
GGAAGTCTCA 
CAAACCTCTC 
GATGTCTGGT 
ACAGCTGCTG 
CTTGGCATCC 
ATGATGGCTG 
ATTGCTGTCA 
TCCAATGGAA 
AACTTCGTTG 
CTGAGTCGGG 
CCTCTGCTAG 
GCTTGGCATG 
GGAATACTCT 
TCTTGGAAGC 
AAGCCTTTCA 
TCCGACAACA 
CAAGAAAAAA 
CTCAGTATTC 
TGGCTTCA 



TCTCTCTGCT 
CAGCTGTGTC 
CAGTGGGGAA 
CCAGCCATTT 
TCCTTAATTC 
GCTCACGGTT 
CTCTGAATTT 
TCAAAAGGGG 
GAGGCCGTGT 
GCATGGCCTC 
TCAATGGACC 
TTTTTTCCAA 
ATTTGCAGTG 
GCCAATGTAC 
TCTTCCCCGT 
TTTTATGCCT 
ACACACGTCG 
TTATTGTTGG 
TGTTCTTTAC 
TGCTGGCTTA 
TTGGATTTTG 
CGCAACCTAG 
GCAAACCACT 
TGGTGCTGCT 
ATGACAAGGC 
GGCTCACCTG 
TTATTTTTGC 
TGGACAGTAA 
AAACAGAAAA 
ACCCACTGCA 
TCATGCTAAC 
TTTCCAGAAC 
GATCGGGTTT 



TGAAGAACTG 
ATCCTTOGTG 
TCTGGCTTCG 
CAGGGTGTCC 
AGCCTCAGTA 
ACTAGAGACA 
TTCTCGGAAA 
TTACAGCTAT 
GTTAATTGGG 
GTTGACTCTG 
TGTGATATCC 
GATAGAGTCA 
GAACGATGCA 
TCACTTGACC 
TGTAAAATGG 
GATCATCGAG 
TATTTGCATG 
TGCCACAGTG 
ACACTTCTTC 
CCGGATCATC 
CCTGGGTTAT 
CAATACCTAC 
CCTGGCTTTT 
AGTTCTCACA 
CACCATCATC 
GGGCTTTGGA 
TTTACTCAAT 
GCTGCGACAA 
GCAAAACTCA 
AAACAAAGGC 
TCAGTTTGTC 
AACTTGACAT 
TCTGATTTAG 



AACAAGAATT 
CAAAATCTTT 
GTGGTGTCGA 
AATTCAACAA 
ACCAACTGGA 
TTAGAAAACA 
TTCATTGACT 
CAGATTAAAA 
TCAGACCAAT 
GGGAACATTC 
ACGGTTATTC 
AACCTGAGCC 
GGCTGCCACC 
TCCTTCTCCA 
ATCACCTATG 
GCTTTGTTTT 
GTGAACATAG 
GACACCACGG 
TACCTCTCTT 
CTCGTGTTCC 
GGGTGCCCTC 
AAAAGGAAAG 
GTTGTCCCTG 
AAGCTCTGGA 
CGCGTGGGGA 
ATAGGAACAA 
GCATTCCAGG 
CTTCTGTTCA 
TCAGATTTAT 
CATTATGCAT 
TCAAATGAAT 
TTAGAGACAA 
GGGTCTGGGA 



TCAGTATGAT 
CTGTCATCAT 
TTCTGAGCAA 
TGGAGGATGT 
CAGTCTTACT 
TCAGCACTCT 
GGAAAGGGAT 
TGTGTCCCCA 
TCCAGAGATC 
TACCCGTTTC 
AAAACTATTC 
AGCCTCATTG 
TAGTGAATGA 
TATTGATGTC 
TGGGACTGGG 
GGAAGCAGAT 
CCCTGTCCCT 
TGAACCCTTC 
TGTTCTTCTG 
ATCACATGGC 
TCATTATATC 
ATGTGTGTTG 
CACTGGCTAT 
GGCCGACTGT 
AGAGCCTCCT 
TAGTGGACAG 
GATTTTTTAT 
ACAAGTTGTC 
CTGCCAAACC 
TTTCTCATAC 
AAGGCAAGGA 
ATGTCAATGA 
ATAAAACAAG 



1140 

1200 

1260 

1320 

13B0 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 



Seq ID NO: 49 Protein sequence: 
Protein Accession #: EOS sequence 

1 11 21 31 

j 1 ) I 

MKVGVLWLIS FPTPTDGHGG FLGKNDGIKT KKELIVNKKK 
EKRDLRNFLK LLKPPLLWSH GLIRIIRAKA TTDCNSLNGV 
NCYLHTAGAL PSCECHLNNL SQSVNPCERT KIWGTFKINE 
IEIQLKKAYE RIQGFESVQV TQFRNGSIVA GYEWGSSSA 
LFPLEDGSFR VFGKAQCNDI VFGFGSKDDE YTLPCSSGYR 
LSLLEELNKN FSMIVGNATE AAVSSFVQNL SVIIRQMPST 
ASHFRVSNST MEDVISIADN ILNSASVTNW TVLLREEKYA 
PLNFSRKFID WKGIPVNKSQ LKRGYSYQIK MCPQNTSIPI 
SMASLTLGNI LPVSKNGNAQ VNGPVISTVI QNYSINEVFL 
HLQWNDAGCH LVNETQDIVT CQCTHLTSFS ILMSPFVPST 
ILCLI IEALF WKQIKKSQTS HTRRICMVNI ALSLLIADVW 
VFFTHFFYLS LFFWMLMLGI LLAYRI ILVF HHMAQHLMMA 
TQPSNTYKRK DVCWLNViSNG SKPLLAFWP AIAIVAVNFV 
DDKAT1IRVG KSLLILTPLL GLTWGFGIGT IVDSQNLAWH 
LDSKLRQLLF NKLSALSSWK QTEKQNSSDL SAKPKFSKPF 
IMLTQFVSNE 

Seq ID NO: 50 DNA sequence 

Nucleic Acid Accession 8: EOS sequence 

Coding sequence: 63-3224 



41 
I 

HLGPVEEYQL 
IiQCTCEDSYT 
RFTNDLLNSS 
SELLS AIBHV 
GNITAKCESS 
TVGNLASWS 
SSRLLETLEN 
RGRVLIGSDQ 
FFSKIESNLS 
IFPWKWITY 
FIVGATVDTT 
VGFCLGYGCP 
WLLVLTKLW 
VIFALLNAFQ 
NPLQNKGHYA 



51 
I 

LLQVTYRDSK 
WFPPSCLDPQ 
SAIYSKYANG 
AEKAKTALKK 
GWQVIRETCV 
ILSNISSLSL 
ISTLVPPTAL 
FQRSLPETII 
QPHCVFWDFS 
VGLGISIGSL 
VNPSGVCTAA 
LIISVITIAV 
RPTVGERLSR 
GFFILCFGIL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



CAGGAATTTG 
CCATGTCGAG 
AGCAGCAGCC 
CGGGGGAGCT 
AAGCCACAGT 
AGTTCTTCAG 
TCCTCATCAT 
TGAGCCACCA 
CAGAGGACAT 
CCCTGCCACC 
ATGTGACCTT 
CACTCCCTGC 
GTCTGCAGCA 
ACATGTGGGC 
TGGGCAGCCA 



11 
I 

TGGCGGAGAG 
GCGCAAGCAG 
GCAGCAGCAG 
GGGTGCTCCA 
AAAGCGGCTT 
CATCTCTGAG 
GAATGACAGC 
GCCCACCAGT 
GAAGGAGAAG 
CACCCCCCAG 
GCAGGCACTA 

ccccgtgcct 

GCAGCAGCTA 
CTCCCACGCC 
CATGTCTCAG 



21 
I 

GGCAAATAAC 
GCGAAACCCC 
ACCCCGGAGT 
GTGAACCACC 
CGTCGGGAGG 
TTCCTGGAAC 
GAGGGGCCTG 
CCOGGCAGTA 
CCGGATGCGG 
GACATAAGCT 
CGGGGCACCA 
GGTGCCAACA 
CAGCAGATCC 
CTCCACrCAA 
CAGGTTTCTG 



31 
I 

TGCGGCTCTC 
AGCACATCAA 
TTGCAGATGC 
CAGGGAATGA 
AGACGCACGT 
ATAAGAAAAA 
TGCCTTCAGA 
AGGACTGTCA 
AGTCTGTGGT 
ATTTAGCCAA 
AGGTGGCGGT 
GCATCCOGTG 
AGCTCACCGA 
GCGGGGCAGG 
CAGCTGTGGC 



41 
I 

CCGGCGCCCC 
CTCGGAGGAG 
GGCCCCAGCJG 
CGAGGTGGCG 
CTGTGAGAAA 
TTGCACTAAA 
AGACTTCTCC 
CAGGGAGAAT 
GTACCTAAAG 
AGGCAAAGTG 
GAATCAGCGG 
GGTCCTCGAG 
GCAGATCCGC 
GGCCGACACT 
TTTGCTCAGC 



51 
I 

GATGCTCGCA 
GACCAGGGCG 
GCGCCCGCGG 
AGTGAGGATG 
TGCTGTGCGG 
AATCCACCTG 
GGAGCTGTAC 
GGCGGCAGCT 
ACAGAGACAG 
GCCAACACTA 
AGCGCGGATG 
CAGATCTTGT 
ATCCAGGTGA 
CTGAAGACCT 
CAGAAAGCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



245 



WO 03/003906 



PCTAJS02/21338 



GAAGCCAAGG TCTGTCTCTG GATGCCTTGA 
CTTCTGCCAC CAGCTCCCTG TCCCCAGGGC 
CCCGGGTGCT CCCGAACGTC ATGTCCCGCC 
GCTCGGTGCT CTTCCAGAGC CCTTTCTCCA 
5 AGGGGAAGCC ACCGAACATC TCCGCGGTGG 
ACAAGCACAA GTGTAAGTAC TGTAGCAAGG 
ACCTCCGCTC CCACACTGGA GAGAGACCCT 
CCACCAAGGG CAACCTCAAG GTGCACTTTC 
AGCTGTTTGC CGAGTTCCAG GACAAAGTGG 

10 CTGTACCTGA CCCCATAGAT GAACCGAGTC 
CCACCTCTGT AGGGCTACCT CAGAATCTTT 
GTGGCTCCTT GCCCGGTGAC CTGCAGCCTG 
CACTCCCTGG GGTGGGACCA AACTATAATT 
GGACCCCTGA GCCAGGGTCA GAGACCCTGA 

15 AGGCCACCAC TGATCCCAAC GAATGTCTCA 
CCCTCAAGAT GCATTATCGC ACCCACACCG 
GCCGAGCCTT TTCTACCAAA GGTAACCTGA 
CATCCATTAA GACGCAGCAT TCGTGCCCCA 
TGCTGCAGCA ACATATTCGG ATGCACATGG 

20 AGAATCCCTG TGACTTTACG GGTTCTGAGC 
GCGCTATCTG CCATGATGAT GTCATCGAAA 
AGGCTCCCAG CAGCTCCTCC AAGGTCCCCA 
CCACGCTAGG GTTTGCCATG ATGGCTTCCT 
CTTTTAACCT GCAGCGCCAG GGCAGCAGAG 

25 CCAACGACTC ATCCTCGCTG ATGGGAGACC 
TGGAAACCAC ATCCTTCCAG GCACTCTCCC 
CAAAGTCTCC CGATGCTGGG AGCAAAGCAG 
AAGGTOGGAG CAGTCTCCCT TCCACGTTTA 
AAGTTCCTGG CACATTTGTG GGACCCTCGA 

30 CAGCCCAGCC ACGCCGACAG GCCAAGCAAC 
CGTCTGCTAG CGCTCTTCAG ATCCACGAGC 
GCAACATTTG TGGGCGAGCT TTTACCACCA 
ACGGGGCGAA CAATAACTCA GCCCGCCGTG 
CTCTGTTAGG TACGGACGGA AAAAGAGTCT 

35 CTTCAGTGAA TGTGGACCCT GTTGTGTGGA 
TGGCCGTGAA GACCAATGAG ATCTCTGTGA 
TTTCCTTGGG GGCCACCTCC GTTGTGAATA 
AGTCGGGTAT CAGTGCAGAT GTGGAAAAAC 
AGTTTCCTCA CTTCCTGGAA GAAAACAAGA 

40 AAGGAGCAAT GCAGACACAG TGAAATCTCT 
TCCTCCTGTT TTCTTTTTCT TACTGATATG 
ACAACCTCAG GCAAGTGCTA CAATCACGAT 



AACAAGCCAA GCTACCTCAC GCCAACATCC 960 
TGGCACCCTT CACTCTGAAG CCGGATGGGA 1020 
TCCCGAGCGC TTTGCTTCCT CAGGCCCCGG 1080 
CTGTGGCGCT AGACACATCC AAGAAAGGGA 1140 
ATGTCAAACC CAAAGACGAG GCGGCCCTCT 1200 
TTTTTGGGAC TGATAGCTCC TTGCAGATCC 1260 
TCGTGTGCTC TGTCTGTGGT CATCGCTTCA 1320 
ACCGACATCC CCAGGTGAAG GCAAACCCCC 1380 
CGGCCGGCAA TGGCATCCCC TATGCACTCT 1440 
TTTCTTTAGA CAGCAAACCT GTCCTTGTAA 1500 
CTTCGGGGAC TAATCCCAAG GACCTCACGG 1560 
GGCCTTCTCC AGAAAGTGAG GGTGGACCCA 1620 
CCCCAAGGGC TGGTGGCTTC CAAGGGAGTG 1680 
AATTGCAGCA GTTGGTGGAG AACATTGACA 1740 
TTTGCCACCG AGTCTTAAGC TGTCAGAGCT 1800 
GGGAGAGACC GTTCCAGTGT AAGATCTGTG 1860 
AGACACACCT TGGGGTTCAC CGAACCAACA 1920 
TCTGCCAGAA GAAGTTCACT AATGCCGTGA 1980 
GCGGTCAGAT TCCCAACACG CCCCTGCCAG 2040 
CAATGACCGT GGGTGAGAAC GGCAGCACCG 2100 
GCATCGATGT AGAGGAAGTC AGCTCCCAGG 2160 
CGCCTCTTCC CAGCATCCAC TCGGCATCAC 2220 
TAGATGCCCC AGGGAAAGTG GGTCCTGCCC 2280 
AAAACGGTTC CGTGGAGAGC GATGGCTTGA 2340 
AGGAGTATCA GAGCCGAAGC CCAGATATCC 2400 
CGGCCAATAG TCAAGCCGAA AGCATCAAGT 2460 
AGAGCTCCGA GAACAGCCGC ACTGAGATGG 2520 
TCCGAGCCCC GCCGACCTAT GTCAAGGTTG 2580 
CATTGTCCCC AGGGATGACC CCTTTGTTAG 2640 
ATGGCTGCAC ACGGTGTGGG AAGAACTTCT 2700 
GGACTCACAC TGGAGAGAAG CCTTTTGTGT 2760 
AAGGCAACTT AAAGGTTCAC TACATGACAC 2820 
GAAGGAAGTT GGCCATCGAG AACACCATGG 2880 
CAGAAATCTT TCCCAAGGAA ATCCTGGCCC 2940 
ACCAGTACAC CAGCATGCTC AATGGCGGTC 3000 
TCCAGAGTGG GGGGGTTCCT ACCCTCCCGG 3060 
ACGCCACTGT CTCCAAGATG GATGGCTCCC 3120 
CAAGTGCTAC TGACGGCGTT CCCAAACACC 3180 
TTGCGGTCAG CTAAGGGAGA ACTTGCGTGG 3240 
AGAATCTGCT TTGTTTTGTA AGAACTCATC 3300 
CAAATGATGT TTACTACGTT GGTTGTGACC 3360 
TGTTGCTATG CTGCTTTGCA AAAAGTTG 



Seq ID NO: 51 Protein sequence: 
45 Protein Accession #: NP_06S169.1 

1 11 i 1 r i 1 f 

MSRRKQAKPQ HTNSEEDQGE QQPQQQTPEF ADAAPAAPAA GELGAPVNHP GNDEVASEDE 60 

50 ATVKRLRREE THVCEKCCAE FFSISEPLEH KKNCTKNPPV LIMNDSEGPV PSEDFSGAVL 120 

SHOPTSPGSK DCHRENGGSS EDMKEKPDAE SWYLKTETA LPPTPQDISY LAKGKVANTN 180 

VTLQALRGTK VAVNQRSADA LPAPVPGANS IPWVLEQIUC LQQQQU2QIQ LTEQIRIQVN 240 

MWASHALHSS GAGADTLKTL GSHMSQQVSA AVALLSQKAG SQGLSLDALK QAKLPHANIP 300 

SATSSLSPGL APFTLKPDGT RVLPNVMSRL PSALLPQAPG SVLFQSPPST VALDTSKKGK 360 

55 GKPPNISAVD VKPKDEAALY KHKCKYCSKV FGTDSSLQIH LRSHTGERPF VCSVCG^FT 420 

TKGNL.KVHFH RHPQVKANPQ LFAEFQDKVA AGNGIPYALS VPDPIDEPSL SLDSKPVLVT 480 

TSVGLPQNLS SGTNPKDLTG GSLPGDLQPG PSPBSEGGPT LPGVGPNYNS PRAGGFQGSG 540 

TPEPGSETLK LQQLVENIDK ATTDPNECLI CHRVLSCQSS LKMHYRTHTG ERPFQCKICG 600 

RAPSTKGNLK THLGVHRTNT SIKTQHSCPI CQKKFTNAVM LQQHIRMHMG GQIPNTPLPE 660 

60 NPCDFTGSEP MTVGENGSTG AICHDDVIES IDVEEVSSQE APSSSSKVPT PLPSIHSASP 720 

TLGFAMMASL DAPGKVGPAP FNIjQRQGSRE NGSVESDGLT NDSSSLMGDQ EYQ^SPDIL 780 

ETTSFQALSP ANSQAESIKS KSPDAGSKAE SSENSRTEME GRSSLPSTPI RAPPTYVKVE 840 

VPGTFVGPST LSPGMTPIjLA AQPRRQAKQH GCTRCGKNFS SASALQIHER THTGEKPFVC 900 

NICGRAFTTK GNLKVHYMTH GANNNSARRG RKLAIENTMA LLGTDGKRVS EIFPKEIUP 960 

65 SVNVDPWWN QYTSMLNGGL AVKTNEISVI QSGGVPTLPV SLGATSWNN ATVSKMDGSQ 1020 
SG1SADVEKP SATDGVPKHQ FPHFLEENKI AVS 



70 



Seq ID NO: 52 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



60 

120 

180 



11 21 31 |1 51 

TCTAGCATTA aaggaagaaa ccccgtcttg CAACGAAGGC ctcaaagagg tccaaatatc 

AACTTGCAGA CATAACAAGC AGAGTGTTTC TAAACTGCTC TAAGAAAAGA AAGGTTAAAC 
75 TCTGTGAGTT GAAGGCACAC ATCACAAAGT AGTTTCTGAG AATGATTCTG TCTAGTTTTT 

ATTTGAAGAT ATTTCCTTTT CTACTGTTGG CATCAAATCG CTTGAAATCT CCACTTGCAA 240 
ATTCCACAAA AAGAGTGTTT CAAATCTGCT CTGTGTAAAG GGACGTTCCA CTCTGTGAGT 300 
TGAATACACA CAACACAAGG AAGTTACTGA GAATTCTTCT GTCTAGCATG ATATGAAGAA 3 60 

246 
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ATCCCGTTTC CAACGAAGGC CTCAATGAGG TCTATATATC CACTTGCAGA CTTTACAAAC 420 

AGAGTGTTTC CTAACTGCTC TATGAAAAGA AAGGTTAAAC TCTGTGGAGT TGAACGCACA 480 

CATCACAAAA GGAGTTTCTG AGAATCATTC TGTCTAGTTT TTTTAGGAAG ATATTTCCTT S40 
TTCTACCGTT GACTTCAAAG CGGCTGAAAT 

Seq ID NO: 53 DNA sequence 

Nucleic Acid Accession #: NM_001910.1 

Coding sequence: 50.. 1240 

1 11 21 31 41 51 

I I I I I I 

GGAGAGAAGA AAGGAGGGGG CAAGGGAGAA GCTGCTGGTC GGACTCACAA TGAAAACGCT 60 

CCTTCTTTTG CTGCTGGTGC TCCTGGAGCT GGGAGAGGCC CAAGGATCCC TTCACAGGGT 120 

GCCCCTCAGG AGGCATCCGT CCCTCAAGAA GAAGCTGCGG GCACGGAGCC AGCTCTCTGA 180 

GTTCTGGAAA TCCCATAATT TGGACATGAT CCAGTTCACC GAGTCCTGCT CAATGGACCA 240 

GAGTGCCAAG GAACCCCTCA TCAACTACTT GGATATGGAA TACTTCGGCA CTATCTCCAT 300 

TGGCTCCCCA CCACAGAACT TCACTGTCAT CTTCGACACT GGCTCCTCCA ACCTCTGGGT 3 60 

CCCCTCTGTG TACTGCACTA GCCCAGCCTG CAAGACGCAC AGCAGGTTCC AGCCTTCCCA 420 

GTCCAGCACA TACAGCCAGC CAGGTCAATC TTTCTCCATT CAGTATGGAA CCGGGAGCTT 480 

GTCCGGGATC ATTGGAGCCG ACCAAGTCTC TGTGGAAGGA CTAACCGTGG TTGGCCAGCA 540 

GTTTGGAGAA AGTGTCACAG AGCCAGGCCA GACCTTTGTG GATGCAGAGT TTGATGGAAT 600 

TCTGGGCCTG GGATACCCCT CCTTGGCTGT GGGAGGAGTG ACTCCAGTAT TTGACAACAT 660 

GATGGCTCAG AACCTGGTGG ACTTGCCGAT GTTTTCTGTC TACATGAGCA GTAACCCAGA 720 

AGGTGGTGCG GGGAGCGAGC TGATTTTTGG AGGCTAOGAC CACTCCCATT TCTCTGGGAG 780 

CCTGAATTGG GTCCCAGTCA CCAAGCAAGC TTACTGGCAG ATTGCACTGG ATAACATCCA 840 

GGTGGGAGGC ACTGTTATGT TCTGCTCCGA GGGCTGCCAG GCCATTGTGG ACACAGGGAC 900 

TTCCCTCATC ACTGGCCCTT CCGACAAGAT TAAGCAGCTG CAAAACGCCA TTGGGGCAGC 960 

CCCCGTGGAT GGAGAATATG CTGTGGAGTG TGCCAACCTT AACGTCATGC CGGATGTCAC 1020 

CTTCACCATT AACGGAGTCC CCTATACCCT CAGCCCAACT GCCTACACCC TACTGGACTT 1080 

CGTGGATGGA ATGCAGTTCT GCAGCAGTGG CTTTCAAGGA CTTGACATCC ACCCTCCAGC 1140 

TGGGCCCCTC TGGATCCTGG GGGATGTCTT CATTCGACAG TTTTACTCAG TCTTTGACCG 1200 

TGGGAATAAC CGTGTGGGAC TGGCCCCAGC AGTCCCCTAA GGAGGGGCCT TGTGTCTGTG 1260 

CCTCCCTGTC TGACAGACCT TGAATATGTT AGGCTGGGGC ATTCTTTACA CCTACAAAAA 1320 

GTTATTTTCC AGAGAATGTA GCTGTTTCCA GGGTTGCAAC TTGAATTAAG ACCAAACAGA 1380 

ACATGAGAAT ACACACACAC ACACACATAT ACACACACAC ACACTTCACA CATACACACC 1440 

ACTCCCACCA CCGTCATGAT GGAGGAATTA CGTTATACAT TCA1ATTTTG TATTGATTTT 1500 

TGATTATGAA AATCAAAAAT TTTCACATTT GATTATGAAA ATCTCCAAAC ATATGCACAA 1560 

GCAGAGATCA TGGTATAATA AATCCCTTTG CAACTCCACT CAGCCCTGAC AACCCATCCA 1620 

CACACGGCCA GGCCTGTTTA TCTACACTGC TGCCCACTCC TCTCTCCAGC TCCACATGCT 16 BO 

GTACCTGGAT CATTCTGAAG CAAATTCCGA GCATTACATC ATTTTGTCCA TAAATATTTC 1740 

TAACATCCTT AAATATACAA TCGGAATTCA AGCATCTCCC ATTGTCCCAC AAATGTTTGG 1800 

CTGTTTTTGT AGTTGGATTG TTTGTATTAG GATTCAAGCA AGGCCCATAT ATTGCATTTA 1860 

TTTGAAATGT CTGTAAGTCT CTTTCCATCT ACAGAGTTTA GCACATTTGA ACGTTGCTGG 1920 

TTGAAATCCC GAGGTGTCAT TTGACATGGT TCTCTGAACT TATCTTTCCT ATAAAATGGT 1980 

AGTTAGATCT GGAGGTCTGA TTTTGTGGCA AAAATACTTC CTAGGTGGTG CTGGGTACTT 2040 

CTTGTTGCAT CCTGTCAGGA GGCAGATAAT GCTGGTGCCT CTCTATTGGT AATGTTAAGA 2100 
CTGCTGGGTG GGTTTGGAGT TCTTGGCTTT AATCATTCAT TACAAAGTTC AGCATTTT 

Seq ID NO: 54 Protein Sequence 
Protein Accession 8: NP_001901.1 



60 
120 
180 
240 



1 11 21 31 41 51 

MKTLLLLLLV LLELGEAQGS LHRVPLRRHP SLKKKLRARS QLSBFWKSHN LDMIQFTESC 
SMDQSAKEPL INYLDMEYFG TISIGSPPQN FTVIFDTGSS NLWVPSVYCT SPACKTHSRP 
QPSQSSTYSQ PGQSPSIQYG TGSLSGIIGA DQVSVEGLTV VGQQFGESVT BPGQTPVDAE 
FDGILGLGYP SLAVGGVTPV FDNMMAQNLV DLPMFSVYMS SNPEGGAGSE LIFGGYDHSH 
FSGSLNWVPV TKQAYWQIAL DNIQVGGTVM FCSEGCQAIV DTGTSLITGP SDKIKQLQNA 300 
IGAAPVDGEY AVECANLNVM PDVTPTINGV PYTLSPTAYT LLDFVDGMQF CSSGPQGLDI 360 
HPPAGPLWIL GDVFIROFYS VFDRGNNRVG LAPAVP 

Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession &: NM_018058.1 

Coding sequence: 319. .1575 

1 11 f i 1 r r 

TACGCGCTGC GGGACCGGCA GGGGAACGCC ATCGGGGTCA CAGCCTGCGA CATCGACGGG 
GACGGCCGGG AGGAGATCTA CTTCCTCAAC ACCAATAATG CCTTCTCGGG GGTGGCCACG 
TACACCGACA AGTTGTTCAA GTTCCGCAAT AACCGGTGGG AAGACATCCT GAGCGATGAG 
rTrAACGTGG CCCGTGGTGT GGCCAGCCTC TTTGCCGGAC GCTCTGTGGC CTGTGTGGAC 240 
AGAAAGGGCT CTGGACGCTA CTCTATCTAC ATTGCCAATT ACGCCTACGG TAATGTGGGC - 300 
CCTGATGCCC TCATTGAAAT GGACCCTGAG GCCAGTGACC TCTCCCGGGG CATTCTGGCG 3 60 
CTCAGAGATG TGGCTGCTGA GGCTGGGGTC AGCAAATATA CAGGGGGCCG AGGCGTCAGC 
GTGGGCCCCA TCCTCAGCAG CAGTGCCTCG GATATCTTCT GCGACAATGA GAATGGGCCT 
AACTTCCTTT TCCACAACCG GGGCGATGGC ACCTTTGTCG ACGCTGCGGC CAGTGCTGGT 
GTGGACGACC CCCACCAGCA TGGGCGAGGT GTCGCCCTGG CTGACTTCAA CCGTGATGGC 
AAAGTGGACA TCGTCTATGG CAACTGGAAT GGCCCCCACC GCCTCTATCT GCAAATGAGC 

247 



60 

120 

180 



360 
420 
480 
540 
600 
660 
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PCT/US02/21338 



ACCCATGGGA AGGTCCGCTT CCGGGACATC 
GTCCGCACGG TCATCACCGC CGACTTTGAC 
AACATTGCCT ACCGCAGCTC CTCAGCCAAC 
GGAGACCCCC TCATCGAGGA GCTCAATCCC 
5 ACAGGGGGTG TGGTGACCGA CTTCGACGGA 
GGAGAGTCCA TGGCTCAGCC GCTGTCCGTC 
TGGCTGCGAG TGGTGCCACG CACCCGGGTT 
CTCTACACCA AGAAGAGTGG GGCCCACCTG 
TGTGAGATGG AGCCCGTGGC ACACTTTGGC 

10 GTGACGTGGC CAGATGGCAA GATGGTGAGC 
GTGCTGGAGA TCCTCTACCC CCGGGATGAG 
ACACCAATGA ATGCATCCAG TTCCCATTCG 
ACACCTATGG AAGCTACAGG TGCCGGACCA 
ACGAGGATGG CACAGCCTGC GTGGGGACTC 

15 CCCCCACCGC TGCTGCTGCC ACTGCCGCTG 
CACCGGTCCT CGTAGATGGA GATCTCAATC 
CCAGCTGCTG AGCAGGGGTG GGACATGAAC 
AAGTGGGCTT GTGCTGCTGC CTAGACAGTA 
CCCAAGCCCA TCCATGCACA TTACTTAGCT 

20 CTGTGCTGGG CACATAGCTG TGATCACAGC 
ATTCCAGTGG GTCTAATGAC CATATCTTAG 
CTGCACAGGA AGTATGAGGA CTTTAGTGTC 
AAAGCTATGT GACCTTACAC CAGTCACTTA 
AAATGGGGAT TAAGAATAGA ATCTTGGGGT 

25 GACACTTGGC ACAAAACCTG GCACATAGTA 
GGGCTTTGTC AACACGTG 



GCCTCACCCA AGTTCTCCAT GCCCTCCCCT 720 

AATGACCAGG AGCTGGAGAT CTTCTTCAAC 780 

CGCCTCTTCC GOGTCATCCG TAGAGAGCAC 840 

GGCGACGCCT TGGAGCCTGA GGGCCGGGGC 900 

GACGGGATGC TGGACCTCAT CTTGTCCCAT 960 

TTCCGGGGCA ATCAGGGCTT CAACAACAAC 1020 

GGGGCCTTTG CCAGGGGAGC TAAGGTCGTG 1080 

AGGATCATCG ACGGGGGCTC AGGCTACCTG 1140 

CTGGGGAAGG ATGAAGCCAG CAGTGTGGAG 1200 

CGGAACGTGG CCAGCGGGGA GATGAACTCA 1260 

GACACACTTC AGGACCCAGC CCCACTGGAG 1320 

TGTGCCCTCG AGACAAGCCC GTATGTGTCA 1380 

ACAAGAAGTG CAGTCGGGGC TACGAGCCCA 1440 

TCGGCCAGTC ACCGGGCCCC CGCCCCACCA 1500 

CTGCTGCCGC TGCTGGAGCT GCCACTGCTG 1560 

TGGGGTCGGT GGTTAAGGAG AGCTGCGAGC 1620 

CAGCGGATGG AGTCCAGCAG GGGAGTGGGA 1680 

GGGATGTAAA GGCCTGGGAG CTAGACCCTC 1740 

AACAATTAGG GAGACTCGTA AGGCCAGGCC 1800 

AGACAGGGTC GCTGCCCTGA TGGCGCTTAC 1860 

GACACAGATG TGCCCAGGGA GGTGGTGTCA 1920 

CTGAGTTCAA ATCCTGATTC AGGAACTCAC 1980 

ACTTGTTAGC CATCCATTAT CGCATCTGCA 2040 

TAGTGTGGAG ATTAGATTAA ATGTATGTAA 2100 

AAGGCTCAAT AAAAACAAGT GCCTCTCACT 2160 



30 



40 



Seq ID NO: 56 Protein Sequence 
Protein Accession ft: NP_060528.1 



1 11 21 31 41 51 

MDPEASDLSR GILALRDVAA EAGVSKYTGG RGVSVGPILS SSASDIFCDN ENGPNFLPHN 60 

RGDGTFVDAA ASAGVDDPHQ HGRGVALADF NRDGKVDIVY GNWNGPHRLY LQMSTHGKVR 120 

35 FRDIASPKFS MPSPVRTVIT ADFDNDQELE IFFNNIAYRS SSANRLFRVI RREHGDPLIE 180 

EUJPGDALEP EGRGTGGWT DFDGDGMLDL ILSHGESMAQ PLSVFRGNQG FNNNWLRWP 240 

RTRVGAFARG AKWLYTKKS GAHLRIIDGG SGYLCEMEPV AHFGLGKDEA SSVEVTWPDG 300 

KMVSRNVASG EMNSVLEILY PRDEDTLQDP APLETPMNAS SSHSCALETS PYVSTPMEAT 360 
GAGPTRSAVG ATSPTRMAQP AWGLSASHRA PAPPPPPLLL PLPLLLPLLE LPLLHRSS 



Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession ft: AJ279016.1 

Coding sequence: 1..1962 



45 1 11 21 31 |1 51 

ATOTCCAGGA TGTTACCGTT CCTGCTGCTG CTCTGGTTTC TGCCCATCAC TGAGGGGTCC 60 

CAGCGGGCTG AACCCATGTT CACTGCAGTC ACCAACTCAG TTCTGCCTCC TGACTATGAC 120 

AGTAATCCCA CCCAGCTCAA CTATGGTGTG GCAGTTACTG ATGTGGACCA TGATGGGGAC 180 

50 TTTGAGATCG TCGTGGCGGG GTACAATGGA CCCAACCTGG TTCTGAAGTA TGACCGGGCC 240 

CAGAAGCGGC TGGTGAACAT CGCGGTCGAT GAGCGCAGCT CACCCTACTA CGCGCTGCGG 300 

GACCGGCAGG GGAACGCCAT CGGGGTCACA GCCTGCGACA TCGACGGGGA CGGCCGGGAG 360 

GAGATCTACT TCCTCAACAC CAATAATGCC TTCTCGGGGG TGGCCACGTA CACCGACAAG 420 

TTGTTCAAGT TCCGCAATAA CCGGTGGGAA GACATCCTGA GCGATGAGGT CAACGTGGCC 480 

55 CGTGGTGTGG CCAGCCTCTT TGCCGGACGC TCTGTGGCCT GTGTGGACAG AAAGGGCTCT 540 

GGACGCTACT CTATCTACAT TGCCAATTAC GCCTACGGTA ATGTGGGCCC TGATGCCCTC 600 

ATTGAAATGG ACCCTGAGGC CAGTGACCTC TCCCGGGGCA TTCTGGCGCT CAGAGATGTG 660 

GCTGCTGAGG CTGGGGTCAG CAAATATACA GGGGGCCGAG GOGTCAGCGT GGGCCCCATC 720 

CTCAGCAGCA GTGCCTCGGA TATCTTCTGC GACAATGAGA ATGGGCCTAA CTTCCTTTTC 780 

60 CACAACCGGG GCGATGGCAC CTTTGTGGAC GCTGCGGCCA GTGCTGGTGT GGAOGACCOC 840 

CACCAGCATG GGCGAGGTGT CGCCCTGGCT GACTTCAACC GTGATGGCAA AGTGGACATC 900 

GTCTATGGCA ACTGGAATGG CCCCCACCGC CTCTATCTGC AAATGAGCAC ^TGGGAAG 960 

GTCCGCTTCC GGGACATCGC CTCACCCAAG TTCTCCATGC CCTCCCCTGT CCGCACGGTC 1020 

ATCACCGCCG ACTTTGACAA TGACCAGGAG CTGGAGATCT TCTTCAACAA CATTCCCTAC 1080 

65 CGCAGCTCCT CAGCCAACCG CCTCTTCCGC GTCATCCGTA GAGAGCACGG AGACCCCCTC 1140 

ATCGAGGAGC TCAATCCCGG CGACGCCTTG GAGCCTGAGG GCCGGGGCAC AGGGGGTGTG 1200 

GTGACCGACT TCGACGGAGA CGGGATGCTG GACCTCATCT TGTCCCATGG AGAGTCCATG 1260 

GCTCAGCCGC TGTCCGTCTT CCGGGGCAAT CAGGGCTTCA ACAACAACTG GCTGCGAGTG 1320 

GTGCCACGCA CCCGGTTTGG GGCCTTTGCC AGGGGAGCTA AGGTCGTGCT CTACACCAAG 1380 

70 AAGAGTGGGG CCCACCTGAG GATCATCGAC GGGGGCTCAG GCTACCTGTG TOAGATGGAG 1440 

CCCGTGGCAC ACTTTGGCCT GGGGAAGGAT GAAGCCAGCA GTGTGGAGGT GACGTGGCCA 1500 

GATGGCAAGA TGGTGAGCCG GAACGTGGCC AGCGGGGAGA TGAACTCAGT GC^AGATC 1560 

CTCTACCCCC GGGATGAGGA CACACTTCAG GACCCAGCCC CACTGGAGTG TGGCCAAGGA 1620 

TTCTCCCAGC AGGAAAATGG CCATTGCATG GACACCAATG AATGCATCCA GTTCCCATTC 1680 

75 GTGTGCCCTC GAGACAAGCC CGTATGTGTC AACACCTATG GAAGCTACAG GTGCCGGACC 1740 

AACAAGAAGT GCAGTCGGGG CTACGAGCCC AACGAGGATG GCACAGCCTG CGTGGGGACT 1800 

CTCGGCCAGT CACCGGGCCC CCGCCCCACC ACCCCCACCG CTGCTGCTGC CACTGCCGCT I860 

GCTGCTGCCG CTGCTGGAGC TGCCACTGCT GCACCGGTCC TCGTAGATGG AGATCTCAAT 1920 
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5 

10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



CTGGGGTCGG 
CCAGCGGATG 
AGGGATGTAA 
TAACAATTAG 
CAGACAGGGT 
GGACACAGAT 
CCTGAGTTCA 
AACTTGTTAG 
TTAGTGTGGA 
AAAGGCTCAA 



TGGTTAAGGA 
GAGTCCAGCA 
AGGCCTGGGA 
GGAGACTCGT 
CGCTGCCCTG 
GTGCCCAGGG 
AATCCTGATT 
CCATCCATTA 
GATTAGATTA 
TAAAAACAAG 



GAGCTGCGAG 
GGGGAGTGGG 
GCTAGACCCT 
AAGGCCAGGC 
ATGGCGCTTA 
AGGTGGTGTC 
CAGGAACTCA 
TCGCATCTGC 
AATGTATGTA 
TGCCTCTCAC 



CCCAGCTGCT 
AAAGTGGGCT 
CCCCAAGCCC 
CCTGTGCTGG 
CATTCCAGTG 
ACTGCACAGG 
CAAAGCTATG 
AAAATGGGGA 
AGACACTTGG 
TGGGCTTTGT 



GAGCAGGGGT 
TGTGCTGCTG 
ATCCATGCAC 
GCACATAGCT 
GGTCTAATGA 
AAGTATGAGG 
TGACCTTACA 
TTAAGAATAG 
CACAAAACCT 
CAACACG 



GGGACATGAA 
CCTAGACAGT 
ATTACTTAGC 
GTGATCACAG 
CCATATCTTA 
ACTTTAGTGT 
CCAGTCACTT 
AATCTTGGGG 
GGCACATAGT 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



Seq ID NO: 58 Protein Sequence 
Protein Accession 8: CAC08451.1 



1 
I 

MSRMLPFLLL 
FEIWAGYNG 
EIYPLNTNNA 
GRYSIYIANY 
LSSSASDIFC 
VYGNWNGPHR 
RSSSANRL.FR 
AQPLSVFRGN 
PVAHFGLGKD 



LGQSPGPRPT 



11 

I 

LWFIiPITEGS 
PNLVLKYDRA 
FSGVATYTDK 
AYGNVGPDAL 
DNENGPNPLF 
LYLQMSTHGK 
VIRREHGDPL 
QGFNNNWLRV 
EASSVEVTWP 
DTNECIQFPF 
TPTAAAATAA 



21 
I 

QRAEPMFTAV 
QKRLVNIAVD 
LFKPRNNRWE 
IEMDPEASDL 
HNRGDGTFVD 
VRFRDIASPK 
IEELNPGDAL 
VPRTRFGAFA 
DGKMVSRNVA 
VCPRDKPVCV 
AAAAAGAATA 



31 
I 

TNSVLPPDYD 
ERSSPYYALR 
DILSDEVNVA 
SRGILALRDV 
AAASAGVDDP 
FSMPSPVRTV 
EPEGRGTGGV 
RGAKWLYTK 
SGEMNSVLEI 
NTYGSYRCRT 
APVLVDGDLN 



41 
I 

SNPTQLNYGV 
DRQGNAIGVT 
RGVASLFAGR 
AAEAGVSKYT 
HQHGRGVALA 
ITADFDNDQE 
VTDFDGDGML 
KSGAHLRI ID 
LYPRDEDTLQ 
NKKCSRGYEP 
LGSWKESCE 



51 

I 

AVTDVDHDGD 
ACDIDGDGRE 
SVACVDRKGS 
GGRGVSVGPI 
DFNRDGKVDI 
LEIPPNNIAY 
DLILSHGESM 
GGSGYLCEME 
DPAPLECGQG 
NEDGTACVGT 
PSC 



Seq ID NOs 59 DMA sequence 
Nucleic Acid Accession #: FGENESH 
Coding sequence: 1..4794 



I 



ATGGCGTGTC 
AGCGGCTCCT 
GTTCTGAAGT 
TCACCCTACT 
ATCGACGGGG 
CACAGCAGCT 
CCACCTACAA 
TCCTCCCTGG 
TGTCGGGGTG 
GGGGTGGCCA 
CTGAGCGATG 
GCCTGTGTGG 
GGTAATGTGG 
GGCATTCTGG 
TTCTCCCACA 
GGAGGAGACC 
TGCCGGCTGG 
CAGAGGGAGG 
TCCAAAAGCC 
GCGCCTTCTC 
CCCCTTGTCA 
CCCCACCCCC 
CTGATGGCTG 
CTGAGAAGCT 
GAGCTGGGAG 
CTGGGAGAAC 
CCCAAGGTCA 
GGCCCCGGGA 
CTCTCCCATC 
GTGCCGGGAG 
CTGGCGTGGA 
TTTAGGCTCA 
CTGCAGTTCC 
TCTGCCACTC 
ATCCTCAGCA 
TTCCACAACC 
GCCTTCATCG 
CTAGCAGAAA 
CCACATTGCC 
TTCTTGACGC 
CAGGGGGCCC 
ACTGCCTATT 
TTGTCCTCTG 
GCCCTGGCTG 



11 
I 

CGGGAGGACT 
CCCCAGCATC 
ATGACCGGGC 
ACGCGCTGCG 
ACGGCCGGGA 
CAGCGCAGGT 
CCCCTGCAGG 
GTCAGGCTTC 
GACTGAGACC 
CGTACACCGA 
AGGTCAACGT 
ACAGAAAGGG 
GCCCTGATGC 
CGCTCAGAGA 
CTGCCTCTCC 
CAGAGGAGGC 
GCTGGAAGGA 
CTGGGGCAGC 
ATTTGGCTGA 
CAGCCCACCC 
CTCAGCTAAT 
GAGCCCCAGG 
AGGCTTTGGG 
GGGAGGAAAG 
GTCCCTGGAG 
CTCCCATTTT 
CACAGGAGTG 
GGGTGGCCAA 
CCCTGGTCCC 
CTGCCCTGCC 
ACCAGATGGA 
GGAAAGCACG 
CCTCAGGCCT 
ACTGTGGGTC 
GCAGTGCCTC 
GGGGCGATGG 
TTCACCTCAA 
CTGGTCCTTC 
ATCATGGTTT 
AAGGCTTGGC 
CACCCTGCCT 
ACATTGTCCT 
AAAGAGTCAA 
ACTTCAACCG 



21 
I 

CCCAGCCCGT 
CCCTCCCCAT 
CCAGAAGCGG 
GGACCGGCAG 
GGAGATCTAC 
CCCTTCTGGG 
CCTCCTGGGT 
TCCGGACAGC 
TACCCATGAA 
CAAGTTGTTC 
GGCCCGTGGT 
CTCTGGACGC 
CCTCATTGAA 
TGTGGCTGCT 
AAGCATTGGT 
AGATGAGGAG 
CGGGCAGTTC 
TGGCGTGCCC 
CAAGAACCTA 
TTTCCCTGCC 
GACACATGGA 
AATGGACCCC 
CGCGTGGCCA 
CAGGCAGAAG 
CCAAGCCACA 
ACAAAGAACA 
CCATCTAGTG 
GCGAGAGATT 
CAACTTCCCC 
TGGGAATCCT 
AAAAGAGGAG 
GGAAGCAGAA 
CAGAGGCAGC 
GATGTCTTTT 
GGATATCTTC 
CACCTTTGTG 
ATATCACCTC 
CTCCTCCTGC 
GTCTATGAGC 
CTCCAGTGCC 
TCTGGCAAGA 
GTGGTCTGCC 
CGTGGGTGTG 
TGATGGCAAA 



31 
I 

TGCTCTGGTT 
TCCTCCTCCA 
CTGGTGAACA 
GGGAACGCCA 
TTCCTCAACA 
CTCCACAGAA 
CTGCCTCCAC 
AGGCAGGGAG 
CCAGAACCAT 
AAGTTCCGCA 
GTGGCCAGCC 
TACTCTATCT 
ATGGACCCTG 
GAGGCTGGGG 
GAGATATCTG 
CACAGTGGGG 
AAGGAAGAAG 
AGAGGACGTG 
TTTGGCCCAC 
CGCCAAGCCC 
CGTCTGGCTG 
AAATGTAAGG 
GCGCTCAGCA 
GGGCAGGCCA 
CAGCACCTGC 
GACGGAGATC 
GCCACCATGC 
GGGAGAGAGA 
AGCTGCTTGA 
GGGAACTGGG 
GGGAAGATTC 
TTCCCCCCAG 
CCTGTCCTCC 
CTAGGGGGCC 
TGCGACAATG 
GACGCTGCGG 
TGCAGAGATT 
TGCCCGTGGC 
TTTACAAGGA 
CACCGGAGGA 
GCTCCCTGTG 
ATCCCAGAGA 
GACGACCCCC 
GTGGACATCG 



41 



GGATGGGACT 
GGTACAATGG 
TCGCGGTCGA 
TCGGGGTCAC 
CCAATAATGC 
ACAGGCCTGT 
TCAGCGGAAG 
AGAGGGTGCC 
TTCTTCTGAG 
ATAACCGGTG 
TCTTTGCCGG 
ACATTGCCAA 
AGGCCAGTGA 
TCAGCAAATA 
GCAGAACCGA 
ATGGAAGCAC 
CAGCAGCTTT 
TTCGAACAGC 
CATGTTACTA 
CCCAACACTA 
GAAAACTAGC 
GCCGCCATGC 
CCACTGTGGT 
TGTCCAGATG 
CTGCTAGAGA 
CAGGGAGGAG 
CAGCTCTCGG 
CTGGGGCAGT 
GGCCTCTTGA 
TTCTGGACAT 
ATGGAGACCA 
GCTCCTCTGA 
AGGTGGGCCT 
GAGGCGTCAG 
AGAATGGGCC 
CCAGTGCTGA 
TTCCTCACTC 
ATGCACGTCT 
CCGGGTCACG 
CACTCAGCCT 
TCCTGGGGTC 
GCCTGATGAC 
ACCAGCATGG 
TCTATGGCAA 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 



51 
I 

GGGTGGGCCC 
ACCCAACCTG 
TGAGCGCAGC 
AGCCTGCGAC 
CTTCTCGGGC 
GCTGAAGCCT 
GGACTTTTCC 
GGTTCCCTGC 
ACCCAAATCA 
GGAAGACATC 
ACGCTCTGTG 
TTACGCCTAC 
CCTCTCCCGG 
TACAGAAGGC 
GGAGCGGGAA 
CAGCCAACTG 
GGTGGAGGAA 
TCTGCAGACT 
TTCTGTCTGC 
CCCTGTAGCC 
CCGGAGTGTC 
TGAGCCCGGC 
GCCAGGGGGC 
TGCACTCAGG 
GCTGTATGAC 
AAGGGACTCG 
GGGACTCGAG 
AGGAAGACCA 
AGCCGGGACA 
GGCCAAGGCC 
TGAGCCCAGA 
GGAGCCTCTG 
GGGGCTTGCT 
CGTGGGCCCC 
TAACTTCCTT 
ACGTCGTTTA 
CCTGTGCCAC 
TCTTCAGGCT 
GTTCTATTCA 
CCAGGGTTCT 
TCTGATCCCC 
CCACAGCTAT 
GCGAGGTGTC 
CTGGAATGGC 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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WO 03/003906 



CCCCACCGCC TCTATCTGCA AATGAGCACC CATGGGAAGG TCCGCTTCCG GGACATCGCC 2700 

TCACCCAAGT TCTCCATGCC CTCCCCTGTC CGCACGGTCA TCACCGCCGA CTTTGACAAT 2760 

GACCAGGAGC TGGAGATCTT CTTCAACAAC ATTGCCTACC GCAGCTCCTC AGCCAACCGC 2820 

CTCTTCCGAT GCTCCATCCT GGCTCGTGGC TCTTCATCCT TGACAGCTGG TGGGAGGAAC 2880 

GGTCAGGGAG AAGGTTTAAG AATCAGAAGG GGAGGGTTCC CAGGGCCAGG GGGTCAGGCC 2940 

AAGGTCAACA CAGGTCCCCT GATGAAGAAA CAGAAAGGAA GGAAGGACGA GGACTGGGCA 3000 

AGAGGCTGTG GGAATGCAGG GCAAAGCCTG GCCAAGGAGC CGGCCTCTGC TATTGCAGGG 3060 

AAAGGGAAGG GAAATGTGGC CCAAAGTGTG CCCAGAACCC AAGCGCCACA AGATACAAAG ,3120 

CCACACTACC ACAAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG 3180 

GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC ACTACCAGGA AAAGGGGCTA 3240 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC AATCACTACC AGGAAAAGGG 3300 

GCTACGGGCT CCAATCACTA CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA 3360 

AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA CAGGGTCCAA TCACTACCAC 3420 

AGAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGGG GCTACGGGGT CCAATCACTA 3480 

CCAGGAAAAG GGGCTACAGG GTCCAATCAC TACCAGGAAA AGGGGCTACG GGGTCCAATC 3540 

ACTACCAGGA AAAGGGGCTA CGGGCTCCAA TCACTACCAG GAAAAGGGGC TACGGGGTCC 3600 

AATCACTACC AGGAAAAGGG GCTACAGGGT CCAATCACTA CCAGGAAAAG GGGCTACAGG 3660 

GTCCAATCAC TACCACAGAA AGGGGCTACG GGCTCCAATC ACTACCAGGA AAAGGGGCTA 3720 

CGGGGTCCAA TCACTACCAG GAAAAGGGGC TACGGGCTCC AATCACTACC AGGAAAAGAG 3780 

GCTATGGGGT CCAATCACTA CCAGGAAAAG GGGCTACGGG CTCCAATCAC TACCAGGAAA 3840 

AGGGGCTATG GGGTCCAATC ACTACCACAG AAAGGGGCTA CGGGGTCCAA CGTCATCCGT 3900 

AGAGAGCACG GAGACCCCCT CATCGAGGAG CTCAATCCCG GCGACGCCTT GGAGCCTGAG 3960 

GGCCGGGGCA CAGGGGGTGT GGTGACCGAC TTCGACGGAG ACGGGATGCT GGACCTCATC 4020 

TTGTCCCATG GAGAGTCCAT GGCTCAGCCG CTGTCCGTCT TCCGGGGCAA TCAGGGCTTC 4080 

AACAACAACT GGCTGCGAGT GGTGCCACGC ACCCGGTTTG GGGCCTTTGC CAGGGGAGCT 4140 

AAGGTCGTGC TCTACACCAA GAAGAGTGGG GCCCACCTGA GGATCATCGA CGGGGGCTCA 4200 

GGCTACCTGT GTGAGATGGA GCCCGTGGCA CACTTTGGCC TGGGGAAGGA TGAAGCCAGC 4260 

AGTGTGGAGG TGACGTGGCC AGATGGCAAG ATGGTGAGCC GGAACGTGGC CAGCGGGGAG 432 0 

ATGAACTCAG TGCTGGAGAT CCTCTACCCC CGGGATGAGG ACACACTTCA GGACCCAGCC 4380 

CCACTGGAGT GTGGCCAAGG ATTCTCCCAG CAGGAAAATG GCCATTGCAT GGACACCAAT 4440 

GAATGCATCC AGTTCCCATT CGTGTGCCCT CGAGACAAGC CCGTATGTGT CAACACCTAT 4500 

GGAAGCTACA GGTGCCGGAC CAACAAGAAG TGCAGTCGGG GCTACGAGCC CAACGAGGAT 4560 

GGCACAGCCT GCGTGGGTAC TGAGCTAGGC TCTAGGCATA CAATGACGTG GAAA CCAAGG 4620 

CCCAAAAAGG AGCTGCAACT TTCCCAAGGC ATCTGCACCC CCGTCTGGTC CTTTTTCCTG 4680 

CCGGGTTGCC GGCTGCTCCT CAAAAGAGCT CAGCTCCAGG CTGCTCCCAG CACCCTTCTC 4740 
CAGAAAGCTC CAGGTATTCC AGAAGCCCAA GTGTATGAAC AAGATCAGGA ATAA 



Seq ID NO: 60 Protein Sequence 
Protein Accession 8: FGENESH 



1 
I 

MACPGGLPAR 
SPYYALRDRQ 
PPTTPAG 1*1/3 
GVATYTDKL.F 
GNVGPDALIE 
GGDPEBADEE 
SKSHttADKNIi 
PHPRAPGMDP 
BLGGPWSQAT 
GPGRVAKREI 
LAWNQMEKEE 
SATHCGSMSF 
AFIVHLKYHL 
FLTQGLASSA 
LSSERVNVGV 
SPKFSMPSPV 
GQGEGIiRIRR 
KGKGNVAQSV 
RGPITTRKRG 
RKGLRAPITT 
NHYQEKGLQG 
AMGSNHYQEK 
GRGTGGWTD 
KWLYTKKSG 
MNSVLEILYP 
GSYRCRTNKK 
PGCRLLLKRA 



11 
I 

CSGWMGUSGP 
GNAIGVTACD 
LPPLSGRDFS 
KFRNNRWEDI 
MDPEASDLSR 
HSGDGSTSQL 
FGPPCYYSVC 
KCKGRHAEPG 
QHLPARBLYD 
GRETGAVGRP 



LGGRGVSVGP 
CRDPPHSLfCH 
HRRTLSLQGS 
DDPHQHGRGV 
RTVITADFDN 
GGPPGPGGQA 
PRTQAPQDTK 
YGVQSLPGKG 
RKRGYGVQSL 
PITTRKRGYR 
GLRAPITTRK 
FDGDGMLDLI 
AHLRIIDGGS 
RDEDTLQDPA 
CSRGYEPNED 
QLQAAPSTLL 



21 
I 

SGSSPASPPH 
IDGDGREEIY 
SSLGQASPDS 
LSDEVNVARG 
GILALRDVAA 
CRLGWKDGQF 
APSPAHPFPA 
LMAEALGAWP 
LGEPPILQRT 
LSHPLVPNFP 
FRURKAREAE 
ILSSSASDIF 
LAETGPSSSC 
QGAPPCltLAR 
ALADFNRDGK 
DQELEIFFNN 
KVNTGPLMKK 
PHYHKKGLQG 
ATGSNHYQEK 
PGKGATGSNH 
VQSLPQKGAT 
RGYGVQSLPQ 
LSHGESMAQP 
GYLCEMEPVA 
PLECGQGFSQ 
GTACVGTELG 
QKAPGIPEAQ 



31 

I 

SSSRYNGPNL 
FLNTNNAFSG 
RCJGERVPVPC 
VASLFAGRSV 
BAGVSKYTEG 
KEEAAALVEE 
RQAPQHYPVA 
ALSTTWPGG 
DGDPGRRRDS 
SCLRPLEAGT 
FPPGSSEEPL 
CDNENGPNFL 
CPWHARXjLQA 
APCVLGSLIP 
VDIVYGNWNG 
IAYRSSSANR 
QKGRKDEDWA 
PITTRKRGYG 
GLQGPITTRK 
YQEKG1»RGPI 
GSNHYQEKGL 
KGATGSNVIR 
LiSVFRGNQGF 
HFGLGKDEAS 
QENGHCMDTN 
SRHTMTWKPR 
VYEQDQE 



41 

1 

VLKYDRAQKR 
HSSSAQVPSG 
CRGGLRPTHE 
ACVDRKGSGR 
FSHTASPSIG 
QREAGAAGVP 
PLVTQLMTHG 
LRSWEESRQK 
PKVTQECHLV 
VPGAALPGNP 
LQFPSGLRGS 
FHNRGDGTFV 
PHCHHGLSMS 
TAYYIVLWSA 
PHRLYU2MST 
LFRCSILARG 
RGCGNAGQSL 
VQSLPGKGAT 
RGYGLQSLPG 
TTRKRGYGLQ 
RGPITTRKRG 
REHGDPLIEE 
1WNWLRWPR 
SVEVTWPDGK 
ECIQPPFVCP 
PKKELQLSQG 



51 
I 

LVNIAVDERS 
LHRNRPVLKP 
PEPFLLRPKS 
YSIYIANYAY 
EISGRTEBRB 
RGRVRTALQT 
RLAGKLARSV 
GQAMSRCALR 
ATMPALGGLE 
GNWVLDMAKA 
PVLQVGLGLA 
DAAASAERRL 
FTRTGSRFYS 
IPESLMTHSY 
HGKVRFRDIA 
SSSLTAGGRN 
AKBPASAIAG 
GSNHYQEKGL 
KGATGSNHYH 
SLPGKGATGS 
YGLQSIiPGKE 
LNPGDALEPE 
TRFGAFARGA 
MVSRNVASGE 
RDKPVCVNTY 
ICTPVWSFFL 



Seq ID NO: 61 DNA sequence 

Nucleic Acid Accession 8: NM_000584.1 

Coding sequencej 75.. 374 

1 U 21 31 41 51 

AGCAGAGCAC ACAAGCTTCT AGGACAAGAO CCAGGAAGAA ACCACCGGAA GGAACCATCT 

250 
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60 



WO 03/003906 



PCT/US02/21338 



5 

10 
15 
20 
25 



CACTGTGTGT 
CTGCAGCTCT 
GCATAAAGAC 
AGAGTGGACC 
TCTGTCTGGA 
CTGAGAATTC 
GAAACTTCAA 
CCAGATGCAA 
CATTGTACCA 
CAAAAAACAA 
TACAAATAGC 
GAATGGGTTT 
TTTGCCATAA 
CCTAGTCTGC 
TTTCTAAGTG 
TGGAAGCACT 
CTATTTATTA 
AGAAGATGAA 
GATATTAAAT 
ACAATTGGGT 
AGTACATTAT 
AGTCTTGTCA 
AACTATTAAA 
TGTTTATTAT 
AAGGCTTTAT 
TTCTGATTGT 
AAGTAAAAAA 



30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



AAACATGACT 
GTGTGAAGGT 
ATACTCCAAA 
ACACTGCGCC 
CCCCAAGGAA 
ATAAAAAAAT 
GCAAATCTAC 
TACAAGATTC 
TGAAATATCC 
CAAATAATTT 
AAAATTGAGC 
GCTAGAATGT 
AGTCAAATTT 
TAGCCAGGAT 
GAAAAAGTAT 
TTAAGTTTTT 
TTTATGTATT 
TCATTGATTG 
GATGTTTTAT 
ACCCAGTTAA 
TGTTTATCTG 
TTGCCAGCTG 
ACAGCCAAAA 
GTACAAATAG 
ATTTTTAACT 
ATGGAAATAT 
AAAAAAAAA 



TCCAAGCTGG 
GCAGTTTTGC 
CCTTTCCACC 
AACACAGAAA 
AACTGGGTGC 
TCATTCTCTG 
TTCAACACTT 
CTGGTTAAAT 
AGAACATACT 
TTAAATATAA 
CAAGGGCCAA 
GATATTTGAA 
AGCTGGAAAT 
CCACAAGTCC 
TAGCCACCAT 
TCATCATAAC 
TATTTAAGCA 
AATAGTTATA 
TAGATAAATT 
ATTTTCATTT 
AAAGTTTTAA 
TGTTGGTAGT 
CTCCACAGTC 
ATTCTTATAA 
TTAAGATGTT 
AAAAGTAAAT 



CCGTGGCTCT 
CAAGGAGTGC 
CCAAATTTAT 
TTATTGTAAA 
AGAGGGTTGT 
TGGTATCCAA 
CATGTATTGT 
TTGAATTTCA 
TATATGTAAA 
GGATTTTCCT 
GAGAATATCC 
GCATCACATA 
CCTGGATTTT 
TTGTTCCACT 
CTTACCTCAC 
ATAAATTATT 
TCAAATATTT 
AAGATGTTAT 
TCAATCAGGG 
CAGATAAACA 
TTGAACTAAC 
GCTGTGTTGA 
AATATTAGTA 
TATTATTTAA 
TTTATGTGCT 
ATGAAACATT 



CTTGGCAGCC 
TAAAGAACTT 
CAAAGAACTG 
GCTTTCTGAT 
GGAGAAGTTT 
GAATCAGTGA 
GTGGGTCTGT 
GTAAACAATG 
GTATTATTTA 
AGATATTGCA 
GAACTTTAAT 
AAAATGATGG 
TTTCTGTTAA 
GTGCCTTGGT 
AGTGATGTTG 
TTCAAGTGTA 
GTGCAAGAAT 
AGTAAATTTA 
TTTTTAGATT 
ACAAATAATT 
AATCCTAGTT 
ATTACGGAAT 
ATTTCTTGCT 
ATGACTGCAT 
CTCCAAATTT 
TAAAATATAA 



TTCCTGATTT 
AGATGTCAGT 
AGAGTGATTG 
GGAAGAGAGC 
TTGAAGAGGG 
AGATGCCAGT 
TGTAGGGTTG 
AATAGTTTTT 
TTTGAATCTA 
CGGGAGAATA 
TTCAGGAATT 
GACAATAAAT 
ATCTGGCAAC 
TTCTCCTTTA 
TGAGGACATG 
ACTTATTAAC 
TTGGAAAAAT 
TTTTATTTTA 
AAACAAAGAA 
TTTTAGTATA 
TGATACTCCC 
AATGAGTTAG 
GGTTGAAACT 
TTTTAAATAC 
TTTTTACTGT 
TTTGTTGTCA 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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1020 

1080 

1140 
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Seq ID NO: 62 Protein Sequence 
Protein Accession *h NP_000575.1 



41 



1 11 r r [ i 

MTSKLAVALL AAFLISAAI/C EGAVLPRSAK ELRCQCIKTY SKPPHPKFIK ELRVIESGPH 
CANTEI I VKL SDGRELCLDP KENWVQRWE KFLKRAENS 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession 8: EOS sequence 

Coding sequence: 52-1146 



I 



11 



GTTACCCAGC 
CAAGTGGTCA 
GTTGGTGGAG 
TCCATGTGCT 
ACCAATGGAA 
TCAAGAAGGG 
TGTTGCCGTG 
ATTGTGCCAC 
GTTCGAACGA 
ACAACTGTTC 
GTTTCAACGA 
GTGACAACAA 
CCAGTAGCCA 
GGAGCAATAA 
GACACCGTGA 
GAACATAGTC 
GTCTTGGTGC 
GAGGTTCAAC 
GTTGAAAAGG 
GACTAAGACC 
ACATCAGCAC 
GGCATTCCAA 
GTCAACCTCA 
ATCATTGCAG 
ATAAATCCTG 
TTGGCCATCA 
GAGACTTCAT 
TTGCATTTCC 
AACAAAGTGA 
ATTAAGTACT 

tatggaacac 
ocaatttttt 
atggtgaaat 
cagcctcacg 

TTTTTTTAGT 
CAGAGGATCC 



ATTGTGAGTG 
TCTTAAGCCT 
AGGCAGGTCC 
GGAATAGAGG 
CCCACGTCAC 
ATGTCTCTTT 
TTGAGCACCG 
CCAAGGTCAC 
GCACCACTGT 
CAACAACAAT 
CAACGAGCGT 
CTGTCTCTAC 
CTTCACCATC 
GGAGAGAACC 
CAGAGTCTTC 
TACTGACGGC 
TTCTTGCTCT 
AACTAAGTGT 
AAGTCCAAGC 
CAGTGGTGCT 
ATCAGACGTC 
CATGTCAGTG 
TCATTAATGT 
TTTTCTCTCA 
TATACATCAA 
CAGCTGTCCA 
TTGGAAGCAT 
TATGTTTTCC 
GTCTAACTCA 
GGATCTCTGA 
ATAGACACCA 
TTTTTTTTTT 
CTCGGCTCAC 
AGTAGCTGGG 
AGAGACAGGG 
AGCCTCCTTG 



21 

1 

ACAGAGCCTG 
CATCCTACAT 
ATCTGTCACA 
CTCATGTTCT 
CTATCGGAAG 
GACCATAGAA 
TGGGTGGTTC 
GACTACTCCA 
TCCAACGACA 
GAGCATTCCA 
TCCAACGACA 
CTTTGTTCCT 
TTCACCTCAG 
CACCAGCTCA 
AGATGGCCTT 
CAATACCACT 
TTTGGGTGTC 
TTCATTTAGC 
AGAAGACAAT 
CTTTGAGAGT 
TTTTAGACCC 
ATACTGGGTA 
AGTCCTAATT 
AATATGAACA 
AGAGAATAGT 
GAAGAGGGGA 
TGTATGATTA 
AAAGGTTTCA 
CAGGTTTAGC 
ATTGGGGTAG 
GGGGAAGAAA 
TGAGATGGAG 
TGTAACCTCC 
ATTACAGGTG 
TTTCATCATG 
GCCTTCCAAA 



31 
I 

GATCTGAACG 
CTGGCAGATT 
CTACCCTGCC 
CTATTCACAT 
GACACACGCT 
AATACAGCTG 
AATGACATGA 
ATTGTCACAA 
ACGACTGTTC 
ACGACAACGA 
ACGAGCATTC 
CCAATGCCTT 
CCAGCAGAAA 
CCATTGTACT 
TGGAATAACA 
AAAGGAATCT 
ATCATTGCCA 
AGCCTTCAAA 
ATCTACATTG 
TTACGCCCAT 
CAAGACAATT 
GAGTAACTCT 
TTTTATGCTA 
CTTTATAATT 
CCCTGGAAAC 
ATCTGTCTTA 
TCTCTTGTTT 
AATCGTGGGT 
TGTTTTCTCA 
CTGTTTTACC 
ATCATTTGCC 
CTTTGCTCTT 
ACCTTCCGGG 
TCCGCCACTA 
TTGGCCAGAA 
GTGCTGGGAT 



41 



I 

CTGATCCCAT 
CTGTAGCTGG 
ACTACAGTGG 
GCCAAAATGG 
ATAAGCTATT 
TGTCTGACAG 
AAATCACCGT 
CTGTTCCAAC 
CAATGACGAC 
CTGTTCTGAC 
CAACAACAAC 
TGCCCAGGCA 
CCCACCCTAC 
CTTACACAAC 
ATCAAACTCA 
ATGCTGGAGT 
AAAAGTATTT 
TTAAAGCTTT 
AGAATAGTCT 
GAGTGCAGAA 
TTTCTGTTTC 
CTCACTCCAA 
AAACTGGCTC 
GTATGTTCTT 
ATAGCAAATG 
AAAACCAGCA 
CTATGTTATA 
TTTTATTTCC 
TAACTCTGGA 
AGTTAAAGAG 
AGGTGATTTA 
GTTGCCCAGG 
TTCAAGCAAT 
TGCTCAGCTA 
TGATCTCGAA 
TACAGGCATG 



60 



51 
I 

AATGCATCCT 
TTCTGTAAAG 
AGCTGTCACA 
CATTGTCTGG 
GGGGGACCTT 
TGGCGTATAT 
ATCATTGGAG 
CGTCACGACT 
TGTTCCAACG 
GACAATGACT 
AAGTGTTCCA 
GAACCATGAA 
GACACTGCAG 
AGATGGGAAT 
ACTGTTCCTA 
CTGTATTTCT 
CTTCAAAAAG 
GCAAAATGCA 
TTATGCCACG 
GACTGAACAG 
AGTTTCATCT 
ACTGTGTATA 
AATCCTTCTG 
TTTAGACCCC 
AACTTCTATC 
AATCCAACGT 
CTTCCAAATG 
TCCGTGGGGA 
AATGTGATGC 
CCTACAATAG 
ACATATTTAT 
CTGGAGTGCG 
TCTCCCGCCT 
ATTTTTTGTA 
CTCCTGACCT 
AGCCACCGTG 
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WO 03/003906 
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CCTGGTCCAT 
CATCAGGCTT 
AGGCGGGAGT 
GTTTGAAACT 
AAAATTGTTT 
CAAGGTGGGC 
CCCCATCTCT 
AGGCAAAGTG 
AATTCAATTA 
TTATACTAAA 
ATGCCTGTAA 
TTGAGACCAG 
GGCTAGATTT 
CAATTTCAGT 
CATAGTTGTC 
GTTTTTTTGA 
GGCTCACTGC 
AGCTGGGATT 
ACCATATTGG 
CCCCAAAGTG 
AACGCAGTTA 
AAGATTGTGA 
ATTTTATTTT 
ATTGGAGATT 
GGACTGGATT 
ATCCTTATGT 
CTTTATGGCA 
CAGGAATCTG 
GTCACTAGTA 
CAGGATTAGG 
TAGAGCAAAT 
TCAACTATGA 
GTAAAAAACA 
GCTTGGTAAA 
AGTGTAGTG 



TTATGCAATT 
TCTGAACAAA 
AGGGAAAAGT 
CACTAGTTTT 
AGGAGGCTAG 
AGGTAGCTTG 
ATTAAGAAAA 
TCTTTCTCAT 
GGCTTATTGT 
CCAGTATAAT 
TCCCAGGGCT 
CCTGACCAAC 
GTCAGAAAAG 
TTGAGTAAAC 
TGGGGCAACA 
GATGGAGTTT 
AACCTCCACC 
ACAGATGCCC 
CCAGGCTGGC 
TTGGGATTAC 
TCATTCTAGG 
TACATGGTAT 
TTAAAAGCTA 
TGAAATTAAA 
GCTATGTGAC 
TCCCTGCATG 
ACTGCCCTAT 
GTTAGCTTTG 
GCATTCTGTG 
GATAGGACTT 
GGTTCTTTAC 
TGTACAGTGC 
CTACAGATAC 
TAATTTTTTT 



TTTATTACAC 
TTTTTAGAAC 
CTGCCTTTTC 
GACCCAAATT 
CGCCGTGGCT 
AGCCCAGAAG 
AAATTTAAGG 
ATCTAATCAA 
AAGGATATCT 
TGTTTAGAAG 
TTGGAAGGCT 
ATGGAGAAAC 
AAACATTTCT 
TCTGAAATAA 
GGATGCTTGC 
TGCTCTTGTC 
TCCCAGGTTC 
GCCACAACGC 
CTCCAACTCC 
AGGCATAAGC 
AAATTTATAG 
TTCAGTTTTC 
TAGTGTCTTT 
GCCAAGATTT 
CAGGGTCCCA 
CTCACCTTTG 
TCTCTGGATC 
CTTAACACAT 
CAGTACAATG 
GAGTCTAATC 
GTCTACATTT 
TAATTCCAAT 
TGAAGCAGTT 
TTTCTTTTGA 



GTGTGTCATG 
CAAGAAGATC 
TGCCTTATAA 
CTAAATCGCC 
CATGCCTGTA 
TTTGAGACCA 
AAAAAAAAAA 
TTCCTGTATT 
TTAGTCAGTA 
TATATATTTG 
GAGGTGAGTG 
ACTGTTCTCT 
ACATGAACAG 
TTCATGGTTG 
TTTCAGAGGA 
ACCCAGGCTG 
AAGTGATTCT 
CCAGCTTATT 
TGACCTCAGG 
CACCGCGCCC 
GTCCTTTGAA 
CCAAATGTGG 
TTTTCTTAAA 
GTAGTTTCTC 
CTAGATGGGG 
TTACCTGCCT 
CTTCCTGAGG 
TTCCAAAACT 
TATGGGGGCT 
CTAACTCTTA 
TCTTCATCTG 
GAAATGTTAC 
TGGAGAATTA 
GACAGAGTCT 



GTGCTAAGCA 

AGCAATAAAA 

GTTACATTGC 

TCCAAACCAC 

ATCCCAGCAC 

GCCTGGGCAA 

AAGATAAAAT 

TTTTTTTTTT 

TAAAAATATA 

CCGGCCGGGC 

GATCACTAGA 

ACTAAGTATA 

TTAAGGCTGG 

AGGTTCAGAG 

AACTTTTTGT 

AAATGCAATG 

CCTGCCTCAG 

TTTAGTAGAG 

TGAGCCACCC 

AGCCAGAGGC 
GGAAAATTCT 

CCAGCCCGAT 
TTTGAGGCAA 
TGGAAAGACC 
CTGCATCCTC 
GACACCTGTG 
ATTTATGATG 
TGTTTGAATG 
TAGGAC'TrTA' 
GCAGTTACAC 
TAGATGTAAT 
ATGTGAGAAG 
AAAAACACTA 
GGCTCTGTCA 



TTTAAACAAC 
GAAAAAAAAA 
AGATATAAGT 
TTTCATGTTT 
TTTGTGAAGC 
CATGGCAATA 
TGTTTTGGAA 
TTTTTTTTAC 
TAAAAAACAA 
ATGGTGGCTC 
GGTCGGGAGT 
GATTTGCAAG 
GTATTAATTT 
CTACCCATAC 
TTTTGTTTTT 
GCACAATCTT 
CCTCCTGAGT 
ACAGGGTTTC 
GCTTTGGCCT 
AACATTTTTT 
GTGGGCAAAT 
CTGGTCAAAA 
CATGCACAAA 
TGGCAAGATT 
TAATCCCCAA 
GGGCTTTTAA 
CGTAATACTC 
CATGAGTACA 
GGGTAGTATA 
TGGATGACAT 
AATTTCCATA 
TCTTTGAAAT 
CGAAAACACA 
TCCAGGCTAG 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3160 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 



Seq ID NO: 64 Protein sequence: 
Protein Accession #: NP_036338.1 



1 
I 

MHPQWILSL 
IVWTKGTHVT 
SLEIVPPKVT 
TTTSVPTTTS 
IHREPTSSPL 
VDLALLGVII 



11 



I 

ILHLADSVAG 
YRKDTRYKU* 
TTPIVTTVPT 
IPTTTSVPVT 
YSYTTDGNDT 
AKKYFFKKEV 



21 
I 

SVKVGGEAGP 
GDLSRMJVSL 
VTTVRTSTTV 
TTVSTFVPPM 
VTESSDGLWN 
QQLSVSPSSL 



31 
I 

SVTLPCHYSG 
TIENTAVSDS 
PTTTTVPTTT 
PLPRQNHEPV 
NNQTQLPLBH 
QIKALQNAVB 



41 

I 

AVTSMCWNRG 
GVYCCRVEHR 
VPTTMSIPTT 
ATSPSSPQPA 
SLLTANTTKG 
KEVQAEDNIY 



51 

! 

SCSLPTCQNG 
GWFNDMKITV 
TTVPTTMTVS 
BTHPTTLQGA 
IYAGVCISVL 
IENSLYATD 



60 
120 
180 
240 
300 



Seq ID NO: 65 DNA sequence 

Nucleic Acid Accession B; EOS sequence 

Coding sequence: 2932-4764 



AGAAGAAAGC 
CCAGAGAAGG 
AACTGGGAGG 
GAAGGAGCAC 
GACCTGAAGT 
GCACCCATCC 
CCTGGGGAGG 
CTAGGCCAGC 
GATGAGCCTG 
GCCCTGGCCC 
ACTGTGCTGG 
GTGGGGGGGC 
GAGTACTTCT 
ATCACGGTGA 
GCAGGACTAA 
ACCCGGATAG 
TCGTCCCCTT 
CCGCATGGAC 
TGGCACTTCC 
TAGCCTCACG 
GCCCTTAGAT 
CGCOGGGACC 
CCCTCCAATG 
GAGGTGGGGA 



11 
i 

TGGTAGGGGC 
TTCCCTGAGG 
AGAGAATGTT 
AGCTGGGGTG 
ACCAGCCCCC 
AGGACAGCTT 
CCCAGCTGGA 
AGGACGATGA 
ACGTGGAACT 
ATGGGGATCC 
AAGCCCAGAA 
AGCGCCGTGT 
TGTTCGAATT 
GTGGGGTAGG 
CACCACCTTC 
GCACCTTCAG 
ATTGAGACTC 
CCCTATACAC 
GCCCCTCCCT 
GCCACCCTGT 
GGCCAGTTCT 
AAGGGTTTCA 
CTACCCCCGG 
GGAGGACATG 



21 
1 

TGGGAGAGGG 
AAATAGTACC 
CCAGGCAGAG 
CCTGGAGTGA 
AGAGGGCGCT 
CGAGTTAATC 
GCGGCGGGCA 
AGAGAATGAG 



CTTCCAGGTG 
ACTGGTGGGA 
GACCGCCACA 
TCATGACACG 
GGTGACCAGT 
CCCCAAGGCT 
GATGGACCTG 
TGCACGGACA 
TCACTTCGGA 
GCTGAGCCAG 
GCGTCCCGCC 
ACCAAAGATG 
TTAAGGTCAC 
CCCCAGGGCA 
GATCCGGGGG 



31 
I 

TACCACAGGG 
TGAACTTAGA 
GAAAAGGCAT 
GATGGAGCTG 
ACTGGAGCCT 
ATCCCCAATG 
GTGGCTCTAG 
CTGGAGCTTG 
ATGTTCAGCC 
TCCAGAGCTC 
GTCAACATTA 
CAGCGTGGGA 
CGGCTTCGTC 
GTCCTTCAGA 
TTCCATTCGC 
GGCATCATCT 
AGGGCCCTAG 
GAGGGCCATC 
AGAAGCCCTG 
GGTCGCCCCT 
GGTTCCGCTG 
CTTGTCCGTG 
CTGTTCGGAC 
TGGCCGTGGG 



41 
i 

GAGTATGATC 
CTTGAAGGAT 
ATGCAAAAGT 
GAAAGATCCA 
GGTCAGAGGA 
TGGGCTTCCA 
GCCGCAGGCT 
AGCTGGAGCA 
CCCTCAAGAG 
AAGACTTCCA 
ACCCCTATGT 
CCAGTTGCCC 
TCCAAGACTT 
GAAGGGGGGA 
AGACCCTCCC 
TGGACCAGCC 
AGATTGACCC 
TCTGGCGGAG 
GCCATTGTCC 
TACCCCTGGC 
CATGATCCCC 
AGGGOGCGCG 
ATCGAGAAGT 
GOGCGGATAA 



51 
I 

TACTTGGGGG 
AACAGATGTT 
CCAGCGCCTT 
GGTGGAGCTT 
GGACTTTGGG 
GGAACTGGAG 
AGCTCGAAGT 
GGACCTGGAT 
CCGCGCCAGG 
GGTGGGAGTC 
GGCCGTGCAA 
CTTCTACAAT 
GCTGCTGGAG 
TGAGAAAGCT 
CTTTATGGCC 
AGGTATGGAA 
TGCAGTGACT 
GCTGAACTCT 
GTCACTCCGA 
TCGCCCCTTC 
GAGACACCCG 
GGGACCTGCC 
GAGCCGGGGT 
GGGGAGGGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



252 
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CGAGATCCCA GTTTCTCCCC CCCCGCTCGG TGCCCCCTCC CCTAGGAACC TGCTCCTGCC 
GCGCGGGGTG CCCGCCGAGA GGCCATGGGC GCGGCTCCGC GTGCGCCTGT ACCGCGCCGA 
GGGGCTTCCC GCGCTGCGCC TGGGGCTGCT GGGCAGCCTG GTCCGCGCCC TGCACGACCA 
GCGCGTCCTG GTGGAGCCCT ATGTGCGGGT GTCTTTCCTG GGGCAGGAGG GCGAGACGTC 
5 GGTGAGCGCC GAGGCGGCGG CGCCCGAATG GAACGAGCAG CTGAGCTTCG TGGAGCTCTT 
CCCGCCGCTG ACGCGCAGCC TCCGCCTGCA GCTGCGGGAC GACGCGCCCC TGGTCGACGC 
GGCACTCGCT ACGCACGTGC CGGACCTGAG GCGGATCTCC CATCCGGGCC GCGCGGCGGG 
GTTTAACCCT ACCTTCGGCC CGGCCTGGGT GCCCCTCTAT GGCTCGCCCC CCGGCGCGGG 
GCTCCGGGAT AGTCTTCAAG GTCTCAACGA AGGCGTTGGC CAAGGCATTT GGTTCCGCGG 
10 CCGCCTTCTG CTGGCTGTGT CCATGCAGGT GTTGGAAGGG AGAGCTGAAC CTGAGCCTCC 
CCAGGCCCAG CAGGGGTCCA CGTTGTCCCG GCTCACCCGA AAGAAGAAAA AGAAAGCCAG 
AAGGGATCAG ACCCCAAAGG CGGTTCCGCA GCACTTGGAC GCCAGCCCCG GTGCCGAGGG 
GCCTGAGATC CCCCGTGCCA TGGAGGTGGA GGTGGAGGAG CTGCTGCCGC TGCCAGAGAA 
TGTCCTGGCG CCCTGTGAAG ATTTCCTGCT TTTCGGTGTG CTCTTCGAGG CCACCATGAT 
15 CGACCCCACC GTGGCCTCCC AGCCCATCAG CTTCGAGATC TCCATTGGTG TGTGGCCTAG 
CCGAACCCCT GAGTGCCATT TCAGACCTTA GAACCCTGGA AGGGGTGTTG ACTTTCAGTC 
GCGCAGGCCG TCTGGAGGAG CAATTGGGCC GAGGGTCCAG GGCTGGGGAG GGAACTGAGG 
GTGCAGCCGT GGAGGCTCAG CCTCTGCTGG GAGCCAGGCC AGAGGAGGAG AAAGAGGAGG 
AAGAACTGGG GACCCATGCT CAGCGGCCTG AGCCCATGGA CGGCAGTGGG CCATACTTCT 
20 GCTTGCCCCT CTGTCACTGC AAGCCATGCA TGCATGTGTG GAGTTGCTGG GAGGACCACA 
CCTGGCGCCT GCAGAGCAGC AACTGCGTGC GCAAAGTGGC CGAGAGGCTG GACCAGGGGC 
TGCAGGAGGT TGAGAGACTG CAGCGCAAGC CGGGGCCTGG CGCCTGTGCA CAGCTCAAGC 
AGGCACTGGA AGTACTGGTG GCTGGGAGCA GACAGTTTTG CCACGGTGCC GAGCGCAGGA 
CGATGACCCG GCCCAATGCC CTGGATCGAT GCCGAGGGAA ACTCCTGGTG CACAGCCTGA 
25 ACCTTTTGGC TAAGCAAGGA CTGCGACTTC TACGCGGCCT GAGACGGCGC AATGTGCAAA 
AGAAGGTGGC ACTGGCCAAG AAGCTCCTGG CAAAACTGCG CTTTCTGGCT GAGGAGGCAC 
CCGGGGCAGC CCCTGGTGAG GTCTGTGCCA AGCTGGAGCT CTTCCTGCGG CTGGGCCTGG 
GCAAGCAAGC CAAGGCCTGC ACCTCTGAGC TGCCCCCGGA TTTGCTGCCC GAGCCCTCAG 
CCGGGCTGCC CTCCAGCCTA CACCGGGACG GTCCTGGAGC AGACGCTGAG CCCTCTGTGG 
30 GATGAACTCC TGGTATTTGA GCAGTTGATC GTGGATGGGA GGAGGGAGCA CCTGCAGGAG 
GAGCCTCCAT TAGTGATCAT CAATGTATTT GACCACAATA AGTTTGGCCC CCCCGTGTTC 
CTGGGCAGGG CACTGGCCGC CCCAAGGGTA AAGCTGATGG AGGACCCATA CCAACGCCCA 
GAGTTGCAGT TCTTCCCCCT GAGGAAGGGA CCCTGGGCAG CCGGAGAGCT CATTGCCGCC 
TTTCAACTCA TTGAACTAGA CTACAGTGGC CGACTTGAGC CCTCAGTGCC CAGTGAGGTG 
35 GAGCCCCAGG ATCTGGCACC CCTGGTTGAG CCCCACTCTG GACGCCTGTC CCTTCCACCC 
AACGTGTGCC CAGTGCTCAG GGAGTTCCGT GTTGAGGTGC TGTTCTGGGG TCTTAGGGGA 
CTTGGTCGTG TGCATCTGCT CGAGGTGGAG CAGCCCCAGG TTGTACTGGA GGTGGCTGGG 
CAAGGTGTGG AGTCTGAGGT CCTGGCCAGC TACCGTGAGA GCCCCAATTT CACTGAGCTT 
GTCAGGCATC TGACAGTGGT CTTCAAAGAC ACAGCTCCTC TCTTCCACCC CCAGGACTTG 
40 CCGGAGCAGC CTTACTTGCA GCCTCCACTC AGCATCTTGG TGATTGAGCG CCGGGCCTTT 
GGCCACACAG TCCTTGTGGG TTCCCACATT GTCCCCCACA TGCTGCGATT CACATTTCGG 
GGTCATGAGG ATCCTCCTGA GGAGGAAGGA GAGATGGAGG AGACAGGGGA TATGATGCCC 
AAGGGACCTC AAGGACAGAA GTCCCTGGAT CCCTTCTTGG CTGAAGCGGG TATATCCAGA 
CAGCrCCTGA AGCACAACTT TGATGAAGAT GAAATGGATG ATCCTGGAGA TTCAGATGGG 
45 GTCAACCTCA TTTCTATGGT TGGGGAGATC CAAGACCAGG GTGAGGCTGA AGTCAAAGGC 
ACTGTGTCCC CAAAAAAAGC AGTTGCCACC CTGAAGATCT ACAACAGGTC CCTGAAGGAA 
GAATTTAACC ACTTTGAAGA CTGGCTGAAT GTGTTTCCTC TGTACCGAGG GCAAGGGGGC 
CAGGATGGAG GTGGAGAAGA GGAAGGATCT GGACACCTTG TC3GGCAAGTT CAAGGGCTCG 
TTCCTCATTT ACCCTGAATC AGAGGCAGTG TTGTTCTCTG AGCCCCAGAT CTCCCGGGGG 
50 ATCCCACAGA ACCGGCCCAT CAAGCTCCTG GTCAGAGTGT ATOTGTAAA GGC^CCAAC 
CTGGCTCCTG CAGACCCCAA TGGCAAAGCA GACCCTTACG TGGTGGTGAG CGCTGGCCGG 
GAGCGGCAGG ACACCAAGGA AGGCTACATC CCCAAGCAGC TCAACCCCAT CTTTGGAGAG 
ATCCTGGAGC TAAGCATCTC TCTCCCAGCT GAGACGGAGC TGACGGTCGC CCTATTTGAT 
CATGACCTCG TGGGTTCTGA CGACCTCATC GGGGAGACCC ACATTGATCT GGAAAACCGA 

55 SctaSgcc accacagagc aaactgtcgg ctogcctccc agtatgaagt gtgggtccag 
ISSSSSSi aggagccatt ctgagtttct ggccaaacac attcaagctc acattccctt 
ttgtgtctcc agatcctatg atttcatgga aggggaccct cccacccacc gccactgcca 

ACCAAGACAT AGCTCAGTGG TCAAGACTTG GGCTTGGGAG TCGGGATCCT GXAACGAATG 

tSotgacc gctttctttt tttatgaaac agtctcgctc tgtctcccag gttggagtgc 
60 ag^Scaoga tctcggctga ctgcaacctc cacctcctgg gttcaagcga ttctcctgcc 
tc^gcctccc cagtagcicg gattacaggc gtgggccccc atgtccagct aatttttata 

TTTTCGCTCT GTCTCCCAGG TTGGAGTGCA GTGGCACGAT CTCGGCTGAC TGCAACCTCC 

ACCTCCTGGG ttcaagcgat tctcctgcct cagcctcccc agtagctggg attacaggcg 

TGGGCCCGCA TGTCCAGCTA ATTTTTATAT TTTTAGTAGA GACAGGGTTT CACCATGTTG 
65 TCCAGGCTGG TCTTGAACCC CTGACCTCAA GTGATCCACC CACGTCTGCC TCCCAAAGTG 
CTGG^TTAC AGGTGTGAGC CACCATGCCA GGCCCTCTTA ACCTCTTCAA GTCTGTTTTC 
?S?CTGCAA AACAGAGGTA ATAAGATCAG TATCTTCTTA ATGGAAGCAC CTGGACTACA 
^SStcat TCATTGTTAT CATAAATGAG GACTAACCTG tctcccgttg ggagttttga 
IcS^GACcr catgtcttca toacgtcatc actgccccag gcccagctgt gtccctacac 

70 C^^AGC TGACGCATCT TCTTTTTCTG CCTGTAGAGA TGGTTACAAT GCCTGGCGTG 
ATGCATTCTG GCCTTCGCAG ATCCTGGCGG GGCTGTGCCA ACGCTGTGGC CTCCCTGCCC 
CTGAATACCG AGCCGGTGCT GTCAAGGTGG GCAGCAAAGT CTTCCTGACA CCACCGGAGA 
CC^GCCCC? AGGGATCTCT TCACATGTGG ATTGACATCT TTCCTCAAGA TGTGCCTGCT 
SSc^AG TTGACATCAA GCCTCGGCAG CCAATCAGCT ATGAGCTCAG AGTTGTCATC 

75 tgGAACACGG AGGATGTGGT TCTGGATGAC GAGAATCCAC TCACCGGAGA GA^CGAGT 

gacatctatg tgaagagctg ggtgaagggg ttggagcatg acaagcagga gacagacgtt 
Sotcaact ccctgactgg ggaggggaac ttcaattggc gctttgtgtt ccgctttgac 

ScC^CA CGGAGCGGGA GGTGAGCGTC TGGCGCAGGT CTGGACCCTT TGCCCTGGAG 

253 
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GAGGCGGAGT TCCGGCAGCC TGCAGTGCTG GTCCTGCAGG ATCCCTGGAG TTGCAGCTAC 6180 

CAGACATGGT GCGTGGGGCC CGGGGCCCCG AGCTCTGCTC TGTGCAGCTG GCCCGCAATG 6240 

GGGCCGGGCC GAGGTGCAAT CTGTTTCGCT GCCGCCGCCT GAGGGGCTGG TGGCCGGTAG 6300 

TGAAGCTGAA GGAGGCAGAG GACGTGGAGC GGGAGGCGCA GGAGGCTCAG GCTGGCAAGA 6360 

AGAAGCGAAA GCAGAGGAGG AGGAAGGGCC GGCCAGAAGA CCTGGAGTTC ACAGACATGG 6420 

GTGGCAATGT GTACATCCTC ACGGGCAAGG TGGAGGCAGA GTTTGAGCTG CTGACTGTGG 6480 

AGGAGGCCGA GAAACGGCCA GTGGGGAAGG GGCGGAAGCA GCCAGAGCCT CTGGAGAAAC 6540 

CCAGCCGCCC CAAAACTTCC TTCAACTGGT TTGTGAACCC GCTGAAGACC TTTGTCTTCT 6600 

TCATCTGGCG CCGGTACTGG CGCACCCTGG TGCTGCTGCT ACTGGTGCTG CTCACCGTCT 6660 

TCCTCCTCCT GGTCTTCTAC ACCATCCCTG GCCAGATCAG CCAGGTCATC TTCCGTCCCC 6720 

TCCACAAGTG ACTCTCGCTG ACCTTGGACA CTCACCCAGG GTGCCAACCC TTCAATGCCT 6780 

GCTCCTGGAA GTCTTTCTTA CCCATGTGAG CTACCCCAGA GTCTAGTGCT TCCTCTGAAT 6840 
AAACCTATCA CAGCC 



Seq ID NO: 66 Protein sequences 
Protein Accession #: EOS sequence 

1 11 21 31 41 51 

1 | | t I I 

MCKRRWHWPR SSWQNCAFWL RRHPGQPLVR SVPSWSSSCG WAWASKPRPA PLSCPRICCP 60 

SPQPGCPPAY TGTVLBOTLS PLWDELLVFE QLIVDGRREH UQEEPPLVII NVFDHNKFGP 120 

PVFLGRALAA PRVKLMEDPY QRPELQFFPL RKGPWAAGEL IAAFQLIELD YSGRLEPSVP 180 

SEVEPQDLAP LVEPHSGRLS LPPNVCPVLR EFRVEVLFWG LRGLGRVHLL EVEQPQWLE 240 

VAGQGVESEV LASYRESPNF TELVRHLTW FKDTAPLFHP QDLPEQPYLQ PPLSILVIER 300 

RAFGHTVLVG SHIVPHMLRF TFRGHEDPPE EEGEMEETGD MMPKGPQGQK SLDPFLAEAG 360 

ISRQLLKHNF DEDEMDDPGD SDGVNLISMV GEIQDQGEAE VKGTVSPKKA VATLKIYNRS 420 

UCEEFNHFED WLNVFPLYRG QGGQDGGGEE EGSGHLVGKF KGSFLIYPES EAVLFSEPQI 480 

SRGIPQNRPI KLLVRVYWK ATNLAPADPN GKADPYWVS AGRERQDTKE RYIPKQLNPI 540 

FGEILELSIS LP AFT EL TV A VPDHDLVGSD DLIGETHIDL ENRFYSHHRA NCGLASQYEV 600 
WVQQGPQEPF 



Seq ID NO: 67 DNA sequence 

Nucleic Acid Accession #: NM_002449.2 

Coding sequence: 223.. 1026 

1 11 21 31 41 51 

GGGGGGGGGG GGCAGCCTCT CGGGAAGAGC CAATCAGGGG CGAGCGTCTT CTCGTCGCAC 60 

GAGGCCCGGC GCGGATTGGC GGCGCGCGTC TCCCACTTCC CCTCGGAGGA AAGGCTCAGC 120 

TCCCAGCGCG CCCCTCCCGT CTCCGCAGCA AAAAAGTTTG AGTCGCCGCT GCCGGGTTGC 180 

CAGCGGAGTC GCGCGTCGGG AGCTACGTAG GGCAGAGAAG TCATGGCTTC TCCGTCCAAA 240 

GGCAATGACT TGTTTTCGCC CGACGAGGAG GGCCCAGCAG TGGTGGCCGG ACCAGGCCCG 300 

GGGCTGGGGG GCGCCGCGGG GGCCGCGGAG GAGCGCCGCG TCAAGGTCTC CAGCCTGCCC 360 

TTCAGCGTGG AGGCGCTCAT GTCCGACAAG AAGCCGCCCA AGGAGTCGCC CGCTGTGCCT 420 

CCCGAAGGCG CCTCGGCCGG GGCCCACCTG CGGCCACTGC TGCTGTCGGG GCACTOCGCT 480 

CGGGAAGCGC ACAGCCCCGG GCCGCTGGTG AAGCCCTTCG AGACCGCCTC GGTCAAGTCG 540 

GGAAATTCAG AAGATGGAGC GGCGTGGATG CAGGAACCCG GCCGATATTC GCCGCCGCCA 600 

AGACATATGA GCCCTACCAC CTGCACCCTG AGGAAACACA AGACCAATCG GAAGCCGCGC 660 

ACGCCCTTTA CCACATCCCA GCTCCTCGCC CTGGAGCGCA AGTTCCGTCA GAAACAGTAC 720 

CTCTCCATTG CAGAGCGTGC AGAGTTCTCC AGCTCTCTGA ACCTCACAGA GACCCAGGTC 780 

AAAATCTGGT TCCAGAACCG AAGCGCCAAG GCGAAAAGAC TGCAGGAGGC GGAACTGGAA 840 

AAGCTGAAAA TGGCTGCAAA ACCTATGCTA CCCTCCAGCT TCAGTCTCCC CTTCCCCATC 900 

AGCTCGCCCC TGCAGGCAGC GTCCATATAC GCAGCATCCT ACCCGTTCCA TAGACCTGTG 960 

CTTCCCATCC CGCCCGTGGG ACTCTATGCC ACGCCAGTGG GATATGGCAT ^ACCACCTG 1020 

TCCTAAGGAA GACCAGATCA ATAGACTCCA TGATGGATGC TTGTTTCAAA GGGTTTCCTC 1080 

TCCCTCTCCA CAAAGGCATA GCCAGCCAGT ACTCCTGCGC TGCTAAGCCC TCGACGTTGC 1140 

ACCCCACCCC CTCTAACGGC TAGCTGACAG GGCCACACCA CATAGCTGAA AITTCGTTCT 1200 

GTAGGCGGAG GCACCAAGCC CTGCTTTTCT TGGTGT AACT TCCAGAGTCC CGCCTTTTTT 1260 

CCCTTGCACA AAAGCTTGGC TCTGATGGTT TTTTTGGCAT GATGTATATA TATATATACG 1320 

AAAAATACTA CAGACCCTTT TTATCAGCAG ACGTAAAAAT TCAAATTATT TTAAAAGGCA 1380 

AAATTTATAT ACATATGTGC TTTTTTTCTA TATCTCACCT TCCCAAAAAG ACACATGTGT 1440 
^TCCA^ GTTGTATTTT CTTAAAGAGG GAGACAAATT CGGAGGAGCG CCGOTTCAAG 500 
GTCTCCAGCC TGCCCTTCAG CGTGGAGGCG CTCATGTCCG ATTTGCAAAA AT^CTAAA 1560 
GTCAATGATT TTTACCGGGA TTATTGACTT CTGCTTATAC AAGAAGCCGC CCAAGGAGTC 1620 
CCTCCCGAAG GCGCCTCGGC CGGCCTGCGG AAAAACAAAA GA^CAGAC 1680 
ACAATGCAGC AGCCAGAAAA TATTAGATAT GGAGAGATTA TGGCCACTGC TGCTGACOGG 1740 
CCACGGOGTC CGGGAAGCGC ACAGCCCCGG GCCGCTGGTG ATCAAAGTGA ACCCACATCA 1800 
TATTTCTGCA TTTTACTTGC ATTAAAAGAA ACCTCTTTAT AAGCCCTTCG AGACCGCCTC 1860 
GGTCAAGTCG GGAAATTCAG AAGATGGAGC GGCGTGGATG CTACATACGT TGTTCCTATC 1920 
T^CGCCCAC GCCCACACAT ATTTTTAAAG TTTTTAGGAA CCCGGCCGAT ATTCGCCGCC 1980 
GCCAAGACAT ATGAGCCCTA CCACCTGCAC CCTGACCTTT TTAAGAATAT TTTTGTAAGA 2040 
CCAATACCTG GGATGAGAAG AATCCGTAGA CTGCCGGAAA CACAAGACCA ATCGGAAGCC 2100 
GCGCACGCCC TTTACCACAT CCCAGCTCCT CGCCCTGGAG GTGAGGTAGA AAAATTAGAA 2160 
ATACTTCCTA ATTCTTCTCA AGGCTGTTGG TAACTTTGGA GCGCAAGTTC CGTCAGAAAC 2220 
AGTACCTCTC CATTGCAGAG CGTGCAGAGT TCTCTATTTC AGATAATTGG AGAGTAAAAT 2280 
GTTAAAACCT GTGAGAGGAT TGTACAGCTC TCTGAACCTC ACAGACCCAG GTCAAAAGGT 2340 
TCTGAGAAAT ACTAGGTACA TTCATCCTCA CAGATTGCAA AGGTGCTTTG GGTGGGGGTT 2400 
TAGTAATTTT CTGCTTAAAA AATGAGTATC TTGTAACCAT TACCTATATC TAAATATTCT 2460 
TGAACAATTA GTAGATCCAG AAAGAAAAAA AAAATATGCT TCTCTGTGTG TGTACCTGTT 2520 

254 
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GTATGTCCTA ACTTATTAGA AAAATTTTAT ATCTTTTTAC ATGTGGGGGG CAGAAGGTAA 2580 

AGCATGTTTG ACTTGTGAAA ATGGGATGTC AAACAGCCAT AAGTTCCCTG GTATTCACCT 2 640 

TCCTGTCCAT CTGTCCCCTC CATCGGTATA CCTTTATCCC TTTGAAAGGG TGCTTGTACA 2700 

ATTTGATATA TTTTATTGAA GAGTTATCTC TTATTCTGAA TTAAATTAAG CATTTGTTTT 2760 
ATTCTGAATT AAATTAAGCA TTTGTTTTAT TGCAGTAAAG TTTGTCCA 

Seq ID NO: 68 Protein Sequence 
Protein Accession 8: NP_002440.1 

1 11 21 31 41 51 

1 1 I I I I 

MASPSKGNDL FSPDEEGPAV VAGPGPGLGG AAGAAEBRRV KVSSLPFSVE ALMSDKKPPK 60 

ESPAVPPEGA SAGAHLRPLL LSGHRAREAH SPGPLVKPFE TASVKSGNSE DGAAWMQEPG 120 

RYSPPPRHMS PTTCTLRKHK TNRKPRTPFT TSQLLALERK PRQKQYLSIA ERABFSSSLN 180 

LTETQVKIWP QNRSAKAKRL QEAELEKLKM AAKPMLPSSF SLPFPISSPL QAASIYAASY 240 
PFHRPVLPIP PVGLYATPVG YGMYHLS 



Seq ID NO i 69 DNA sequence 

Nucleic Acid Accession #: NM_005603.1 

Coding sequence: 1..3 756 



11 
i 



21 



ATGAGTACAG 
GTGGTTCCCT 
GAACCAGAAC 
GAATGTACAT 
AACACAAAAT 
AAGTACAACG 
AATTTATATT 
TGGTACACCA 
GTGGACGATG 
ATTAAGGATG 
CGTCTGAAAA 
AACAGCCTCT 
ATGTCACTTG 
GGTTTTATTG 
TGGAGAAACA 
AGGAACACCG 
AAGAATAGTG 
GTTTACACGA 
TATTGGGAAG 
CCCTCCTACC 
CCCATCTCTC 
TGGGACCTGC 
CTCAATGAAC 
CAAAATATCA 
GATGCCTCTC 
GCTGATGGGA 
GAGCCAGAAG 
AGGACTGATG 
GCTGCCAGGA 
GAACTGGGCA 
AAGCGAATGT 
GCTGACACTG 
GATGCCCTGG 
ATTGAAGAAA 
ACCAACCGGG 
CTGGGAGCTA 
CTTGCAAAAG 
AATATAGGAT 
ATTAATTCTC 
AAGTTTGCAC 
ATCACTGGTT 
CTGAAGCTGA 
AGGCTAGAAG 
AGCGCAGTCA 
AAGAGGTACA 
ATCAAAACTG 
TCGAGTGACT 
CGATGGTCTT 
TTTACTTTGG 
GAGGATTGGT 
GGGCTGCTCG 
GTGGGACAAA 
GTCCTAACAT 
CAGGATGGAG 



AAAGAGACTC 
ACAGTGATGA 
AAAACCGAGT 
GGCAAGTCAA 
TCTTGTGTAT 
CATTTACCTT 
TCCTGGCTCT 
CACTAGTGCC 
TGGCTCGCCA 
GCAGGTTCAA 
AAAATGATTT 
GCTATGTGGA 
AAATCACAGA 
AATGTGAAGA 
CAAGTTTTCC 
ATTTCTGCCA 
GGAAAACCAG 
TCTTTGTTGT 
CACAGGTGGG 
GTGGATTCCT 
TCTATGTCAG 
AAATGTACTA 
AGCTCGGGCA 
TGACCTTTAA 
AACACAACCA 
AGCTTGCATT 
TAOGACAGTT 
GTCAGCTCAA 
ACTTTGGCTT 
CTGAAAGGAC 
CTATCATTGT 
TTATTTATGA 
ATATCTTTGC 
AAGAATTTAC 
ACGAAGCTCT 
CAGCTATTGA 
CTGACATTAA 
TTGCTTGTGA 
TTCTTCATGC 
CTCCTGTGCA 
CTTGGTTGAA 
AGTTCCCAAG 
CTAAGAAAGA 
TCTGCTGCCG 
AGAAAGCCAT 
CCCACATTGG 
ATTCCTTTGC 
ACATAAGGAT 
TTCATTTCTG 
TCATCACCCT 
ACCAGGATGT 
GAGACTTACT 
CGATGATCCT 
AGGCACCTTC 



AGAAACGACA 
TGAAACAGAA 
CAACAGGGAA 
AGCAAACGAT 
TAAGGAGAGT 
TATACCAATG 
TCTTATCTTA 
CCTGCTTGTG 
TAAAATGGAT 
AGTTGCTAAG 
TGTTCCAGCT 
AACAGCAGAA 
CCAGTACCTC 
ACCCAATAAC 
TTTGGATGCT 
CGGCTTAGTC 
ATTTAAAAGA 
TCTTATTCTG 
CAATTCCTCT 
CATTTTCTGG 
CGTGGAAGTG 
TGCTGAGAAG 
GATCCATTAT 
AAAGTGCTGT 
CAACAAAATA 
TTATGACCAC 
CTTCTTCTTG 
CTACCAGGCA 
TGCCTTCCTC 
TTACAATGTT 
AAGAACCCCA 
ACGGTTACAT 
AAATGAAACT 
AGAATGGAAT 
GGATAAAGTA 
AGACAAGCTA 
GATCTGGGTG 
ACTTCTGACT 
AAGGATGGAA 
GGAATCTTTT 
TGAAATTCTT 
AACAGAAGAA 
GCAGCGGCAG 
CGTCACCCCC 
CACGCTGGCC 
CGTTGGAATA 
TCAGTTCCGA 
GTGCAAGTTC 
GTACTCCTTC 
CTACAACGTG 
GAGTGACAAA 
ATTCAACTAT 
CTTCTTCATA 
CGACTACCAG 



31 
I 

TTTGACGAGG 
GATGAACTTG 
GCAGAGGAGA 
CGCAAGTACC 
AAATATGCGA 
AATCTGT TTG 
CAGGCAGTTC 
GTGCTGGGCG 
AAGGAAATCA 
TGGAAAGAAA 
GACATTCTCC 
CTGGACGGAG 
CAAAGAGAAG 
CGACTAGATA 
GATAAAATTT 
ATTTTTGCAG 
ACTAAAATTG 
CTTTCTGCTG 
TGGTACCTCT 
GGCTATATCA 
ATTCGTCTTG 
GACACACCCG 
ATCTTCTCTG 
ATCAACGGGC 
GAGCAAGTTG 
TATCTTATTG 
CTCGCAGTTT 
GCCTCTCCCG 
GCCAGGACCC 
CTTGCCATTT 
GAAGGCAATA 
CGAATGAATC 
CTTAGAACCC 
AAAAAGTTTA 
TATGAGGAGA 
CAGGATGGAG 
CTTACTGGAG 
GAAGACACCA 
AACCAGAGGA 
TTTCCACCCG 
CTCGAGAAAA 
GAAAGACGGA 
AAAAACTTTG 
AAGCAGAAGG 
ATCGGAGATG 
AGTGGACAAG 
TATCTGCAGA 
CTACGATACT 
TTCAATGGCT 
CTGTACACCA 
CTGAGCCTCC 
AAGAGATTCT 
CCTCTTGGAG 
TCTTTTGCCG 



41 



51 

I 



I 

ATTCTCAGCC 
ATGACCAGGG 
ACCGGGAGCC 
ACGAACAACC 
ATAATGCAAT 
AGCAGTTTAA 
CTCAAATCTC 
TCACTGCAAT 
ACAATAGGAC 
TTCAAGTTGG 
TGCTGTCTAG 
AAACCAATTT 
ATACATTGGC 
AGTTTACAGG 
TGTTACGTGG 
GTGCTGACAC 
ATTACTTGAT 
GTCTTGCCAT 
ATGATGGAGA 
TTGTTCTCAA 
GACAGAGTCA 
CAAAAGCTAG 
ATAAGACGGG 
AGATATATGG 
ATTTTAGCTG 
AGCAAATCCA 
GCCACACAGT 
ATGAAGGTGC 
AGAACACCAT 
TGGACTTCAA 
TCAAGCTTTA 
CTACTAAGCA 
TATGCCTTTG 
TGGCTGCCAG 
TTGAAAAAGA 
TTCCAGAAAC 
ACAAAAAGGA 
CCATCTGCTA 
ATAGAGGTGG 
GTGGAAACCG 
AGACCAAGAG 
TGCGGACCCA 
TGGACCTGGC 
CCATGGTGGT 
GGGCCAATGA 
AAGGAATGCA 
GGCTACTGCT 
TCTTTTACAA 
ACTCTGCGCA 
GCCTGCCCGT 
GATTCCCTGG 
TTGTAAGCTT 
CTTATCTGCA 
TCACCATTGC 



TAATGACGAA 
GTCTGCTGTT 
ATTCAGAAAA 
TCACTTTATG 
TAAAACATAC 
GAGAGCAGCC 
TACCCTGGCT 
CAAAGACCTG 
GTGTGAAGTC 
AGACGTCATT 
CTCTGAGCCT 
AAAATTTAAG 
TACATTTGAT 
AACACTATTT 
CTGTGTAATT 
TAAAATAATG 
GAACTACATG 
CGGCCATGCT 
AGACGATACA 
CACCATGGTA 
CTTCATCAAC 
AACCACCACA 
GACACTCACA 
GGACCATCGG 
GAATACATAT 
GTCAGGGAAA 
CATGGTGGAT 
CCTGGTAAAC 
CACCATCAGT 
CAGTGACCGO 
CTGTAAAGGT 
AGAAACACAG 
CTACAAGGAA 
TGTGGCCTCC 
CTTAATTCTC 
CATTTCAAAA 
AACTGCTGAA 
TGGGGAGGAT 
CGTCTACGCA 
TGCCTTAATC 
AAATAAGATT 
AAGTAAAAGG 
CTGCGAGTGC 
GGACCTGGTG 
CGTGAACATG 
AGCTGTCATG 
GGTGCATGGC 
AAACTTTGCC 
GACTGCATAC 
GCTCCTCATG 
GTTATACATA 
GTTGCATGGG 
AACCGTAGGG 
CTCTGCTCTT 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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900 

960 
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1080 

1140 
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1500 
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1B00 
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1920 
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2100 

2160 
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2340 

2400 

2460 
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2700 
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2880 

2940 

3000 
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3120 

3180 

3240 
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10 



GTAATAACAG TCAATTTCCA GATTGGCTTG GATACTTCTT ATTGGACTTT TGTGAATGCT 3300 

TTTTCAATTT TTGGAAGCAT TGCACTTTAT TTTGGCATCA TGTTTGACTT TCATAGTGCT 3360 

GGAATACATG TTCTCTTTCC ATCTGCATTT CAATTTACAG GCACAGCTTC AAACGCTCTG 3420 

AGACAGCCAT ACATTTGGTT AACTATCATC CTGACTGTTG CTGTGTGCTT ACTACCCGTC 3480 

GTTGCCATTC GATTCCTGTC AATGACCATC TGGCCATCAG AAAGTGATAA GATCCAGAAG 3540 

CATCGCAAGC GGTTGAAGGC GGAGGAGCAG TGGCAGCGAC GGCAGCAGGT GTTCCGCCGG 3600 

GGCGTGTCAA CGCGGCGCTC GGCCTACGCC TTCTCGCACC AGCGGGGCTA CGCGGACCTC 3660 

ATCTCCTCCG GGCGCAGCAT CCGCAAGAAG CGCTCGCCGC TTGATGCCAT CGTGGCGGAT 3720 
GGCACCGCGG AGTACAGGCG CACCGGGGAC AGCTGA 



Seq ID NO: 70 Protein Sequence 
Protein Accession ft: NP_O0S594.l 



15 1 11 21 31 41 51 

i I I I 1 I 

MSTERDSETT FDEDSQPNDE WPYSDDETE DELDDQGSAV EPEQNRVNRE AEENREPFRK 60 

ECTWQVKAND RKYHEQPHFM NTKPLCIKBS KYANNAIKTY KYNAPTFIPM NLFEQPKRAA 120 

NLYFLALLIL QAVPQISTLA WYTTLVPLLV VLGVTAIKDL VDDVARHKMD KEINNRTCEV 180 

20 IKDGRFKVAK WKEIQVGDVI RLKKNDFVPA DILLLSSSEP NSLCYVETAE LDGETNLKFK 240 

MSLEITDQYL QREDTLATFD GFIECEBPNN RLDKFTGTLF WRNTSFPLDA DKILLRGCVI 300 

RNTDFCHGLV IFAGADTKIM KNSGKTRFKR TXIDYLMNYM VYTIFWLIL LSAGLAIGHA 360 

YWEAQVGNSS WYLYDGEDDT PSYRGFLIFH GYIIVLNTMV PISLYVSVEV IRLGQSHFIN 420 

WDLQMYYAEK DTPAKARTTT LNEQLGQIHY IFSDKTGTLT QNIMTFKKCC INGQIYGDHR 480 

25 DASQHNHNKI EQVDFSWNTY ADGKLAFYDH YLIEQIQSGK EPEVRQFPFL LAVCHTVMVD 540 

RTDGQLNYQA ASPDEGALVN AARKFGFAFL ARTQNTITIS ELGTERTYNV LAI LD FN SDR 600 

KRMSIIVRTP EGNIKLYCKG ADTVIYERLH RMNPTKQETQ DALDIFANET LRTIrCLCYKE 660 

IEEKEFTEWN KKFMAASVAS TNRDEALDKV YBEIEKDLIL LGATAIEDKL QDGVPETISK 720 , 

LAKADIKIWV LTGDKKETAE NIGFACELLT EDTTICYGED INSLLHARMB NQRNRGGVYA 780 

30 KFAPPVQESF FPPGGNRALI ITGSWLNBIL LEKKTKRNKI LKLKFPRTEE ERRMRTQSKR 840 

RLEAKKEQRQ KNFVDLACEC SAVICCRVTP KQKAMWDLV KRYKKAITLA IGDGANDVNM 900 

IKTAHIGVGI SGQEGMQAVM SSDYSFAQFR YLQRIjLLVHG RWSYIRMCKF LRYFFYKNFA 960 

FTLVHFWYSF FNGYSAQTAY EDWFITLYNV LYTSLPVLLM GLLDQDVSDK LSLRFPGLYI 1020 

VGQRDLLFNY KRFFVSLLHG VLTSMILFFI PLGAYLOTVG QDGEAPSDYQ SFAVTIASAL 1080 

35 VITVNFQIGL DTSYWTFVNA FSIFGSIALY FGIMFDFHSA GIHVLFPSAF QFTGTASNAL 1140 

RQPYIWLTII LTVAVCLLPV VAIRFLSMTI WPSESDKIQK HRKRLKAEEQ WQRRQQVFRR 1200 
GVSTRRSAYA FSHQRGYADL ISSGRSIRKK RSPLDAIVAD GTAEYRRTGD S 

Seq ID NO: 71 DNA sequence 
40 Nucleic Acid Accession tt: NM_13B784.1 
Coding sequence: 334-816 

1 11 21 31 41 51 

45 GGGGATACGG AAAACAGAAT AGAAGCACAG AACAAGGACT ATTGTATATA AGAGAAGCAG 60 

GTGTTAATGT AAGTTGCTGT GGAATGACTG TTTTCAAGAA GAGAGAAGGA GCCAATATAT 120 

CTACTTTCAT TAAGAAGCTG AAATTAAATT TGGATTTCCT TCTTTCCAAA GCTGAGACAC 180 

GATGGCCAAA AAGCGATTGC TGTGATTGGA GCTGGAATTA GCGGACTGGG GGCCATCAAG 240 

TGCTGCCTGG ATGAAGATCT GGAGCCCACC TGCTTTGAAA GAAATGATGA TATTGGACAT 300 

50 CTCTGGAAAT TTCAAAAAAA TACTTCAGAG AAAATGCCTA GTATCTACAA ATCTGTGACC 360 

ATCAATACTT CCAAGGAGAT GATGTGCTTC AGTGACTTCC CTGTCCCTGA TCATTTTCCC 420 
AACTACATGC ACAACTCCAA ACTCATGGAC TACTTCGGGA TGTATGCCAC ACACTTTGGC 480 
CTCCTGAATT ACATTCGTTT TAAGACTGAA GTGCAAAGTG TGAGGAAGCA CCCAGATTTT 540 
TCTATCAATG GACAATGGGA TGTTGTTGTG GAGACTGAAG AGAAACAAGA GACTTTGGTC 600 
55 TTTGATGGGG TCTTAGTTTG CAGTGGACAC CACACAGATC CCTACTTACC ACTTCAGTCC 660 
TTCCCAGGCA TTGAGAAATT TGAAGGCTGT TATTTCCATA GTCGGGAATA CAAAAGTCCC 720 
GAGGACTTTT CAGGGAAAAG AATCATAGTG ATCGGCATTG GAAATTCTGG AGTGGATATT 780 
GCGGTGGAGC TCAGTCGTGT AGCAAAACAG ATTTGACCAG TAGGAACTGA GATTCAACGT D ^ rt 
CCCCAAGACT GCAAAAGTAC CTACAATCAA ACAAATTAAT ATATTTCCTA ATTGGATCAG 
60 CATGCAGACA AGTTTACAAA TATATTACAG AAAGATTTGA AGAGGGATTC TGGTTACGTT 

CAGTTAAGCA ggacactaat cctgaacaat ctgatgtatt AAAGTTTCTA ATGTTTGTAT 1020 

CTTTGGTTTG ATGCATACAA TAGAGTGGAA GTCTGTAGTA GTTAATATGA TGATAATTCG 1080 

CTTCTAAATT TAAATCCTTA TTTATATATG AAGCATAGCT ATGTCAATGT CCAGGGAAGC 1140 

ATTTTTAAAT AGTAAATCAG AAACATGTAA ACTAGTGAAT ACAATTTTCT ATTTTTCATT 1200 

65 CAGAAATTTG TTGTCATAGA TGCAGCCTTT ATCTGGAAAT CAGTGGGCAG CTCTGGTCAA 1260 

GCACAGCATG GCGACCACTG TGTAGTATTT TACCACCTAA TCCTTAGTAG CC^CACTGT 1320 

TAAATCTGCA TAATGGGTAG CATATGAATA AAATAATATC TTGTAAAAAA AAAAAAAAAA 1380 
AAAAAAAAAA AAAA 

70 Seq ID NO: 72 Protein sequence: 
Protein Accession ft: NP_620139.1 

i i 1 r r r r 

75 MPSIYKSVTI NTSKE^4MCFS DFPVPDHFPN YMHNSKLMDY FGMYATHFGL LNYIRFKTEV 60 
QSVRKHPDFS INGQWDWVE TEEKQETLVF DGVIjVCSGHH TDPYLPUJSF PGIEKFEGCY 120 
FHSREYKSPE DFSGKRIIVI GIGNSGVDIA VELSRVAKQI 



840 
900 
960 
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Seq ID NO: 73 DNA sequence . 

Nucleic Acid Accession 8: NM__003661.1 

Coding sequence: 1..1152 

1 11 21 31 41 51 

I I I I I I 

ATGAGTGCAC TTTTCCTTGG TGTGGGAGTG AGGGCAGAGG AAGCTGGAGC GAGGGTGCAA 60 

CAAAACGTTC CAAGTGGGAC AGATACTGGA GATCCTCAAA GTAAGCCCCT CGGTGACTGG 120 

GCTGCTGGCA CCATGGACCC AGAGAGCAGT ATCTTTATTG AGGATGCCAT TAAGTATTTC 180 

AAGGAAAAAG TGAGCACACA GAATCTGCTA CTCCTGCTGA CTGATAATGA GGCCTGGAAC 240 

GGATTCGTGG CTGCTGCTGA ACTGCCCAGG AATGAGGCAG ATGAGCTCCG TAAAGCTCTG 300 

GACAACCTTG CAAGACAAAT GATCATGAAA GACAAAAACT GGCACGATAA AGGCCAGCAG 360 

TACAGAAACT GGTTTCTGAA AGAGTTTCCT CGGTTGAAAA GTGAGCTTGA GGATAACATA 420 

AGAAGGCTCC GTGCCCTTGC AGATGGGGTT CAGAAGGTCC ACAAAGGCAC CACCATCGCC 480 

AATGTGGTGT CTGGCTCTCT CAGCATTTCC TCTGGCATCC TGACCCTCGT CGGCATGGGT 540 

CTGGCACCCT TCACAGAGGG AGGCAGCCTT GTACTCTTGG AACCTGGGAT GGAGTTGGGA 600 

ATCACAGCCG CTTTGACCGG GATTACCAGC AGTACCATGG ACTACGGAAA GAAGTGGTGG 660 

ACACAAGCCC AAGCCCACGA CCTGGTCATC AAAAGCCTTG ACAAATTGAA GGAGGTGAGG 720 

GAGTTTTTGG GTGAGAACAT ATCCAACTTT CTTTCCTTAG CTGGCAATAC TTACCAACTC 780 

ACACGAGGCA TTGGGAAGGA CATCCGTGCC CTCAGACGAG CCAGAGCCAA TCTTCAGTCA 840 

GTACCGCATG CCTCAGCCTC ACGCCCCCGG GTCACTGAGC CAATCTCAGC TGAAAGCGGT 900 

GAACAGGTGG AGAGGGTTAA TGAACCCAGC ATCCTGGAAA TGAGCAGAGG AGTCAAGCTC 960 

ACGGATGTGG CCCCTGTAAG CTTCTTTCTT GTGCTGGATG TAGTCTACCT CGTGTACGAA 1020 

TCAAAGCACT TACATGAGGG GGCAAAGTCA GAGACAGCTG AGGAGCTGAA GAAGGTGGCT 1080 

CAGGAGCTGG AGGAGAAGCT AAACATTCTC AACAATAATT ATAAGATTCT GCAGGCGGAC 1140 
CAAGAACTGT GA 



Seq ID NO: 74 Protein Sequence 
Protein Accession #: NP_003652.1 

1 11 21 31 41 51 

MSALFTjGVGV raeeagarvq QNVPSGTDTG DPQSKPLGDW AAGTMDPESS IFIEDAIKYF 60 

KEKVSTQNLL LLLTDNEAWN GFVAAAELPR NEADELRKAL DNLARQMIMK DKNWKDKGQQ 120 

YRNWFLKEPP RLKSELEDNI RRLRALADGV QKVHKGTTIA NWSGSLSIS SGILTLVGMG 180 

LAP FT EGG SL VLLEPGMELG ITAALTGITS STMDYGKKWW TQAQAHDLVI KSLDKLKEVR 240 

EFLGENISNP LSLAGNTYQL TRGIGKDIRA LRRARANLQS VPHASASRPR VTEPISAESG 300 

EOVBRVNEPS ILEMSRGVKL TDVAPVSFFL VLDWYLVYE SKHLHEGAKS ETAEELKKVA 360 
QELEEKLNIL NNNYKILQAD QEL 



Seq ID NO: 75 DNA sequence 

Nucleic Acid Accession ft: NM_0144S2.1 

Coding sequence: 1..1968 



60 
120 



1 11 21 31 41 51 

Itggggacct ctccgagcag CAGCACCGCC CTCGCCTCCT gcagccgcat cgcccgccga 

GCCACAGCCA CGATGATCGC GGGCTCCCTT CTCCTGCTTG GATTCCTTAG CACCACCACA 

GCTCAGCCAG AACAGAAGGC CTCGAATCTC ATTGGCACAT ACCGCCATGT TGACCGTGCC 180 

ACCGGCCAGG TGCTAACCTG TGACAAGTGT CCAGCAGGAA CCTATGTCTC TGAGCATTGT 240 

ACCAACACAA GCCTGCGCGT CTGCAGCAGT TGCCCTGTGG GGACCTTTAC CAGGCATGAG 300 

AATGGCATAG AGAAATGCCA TGACTGTAGT CAGCCATGCC CATGGCCAAT GATTGAGAAA 360 

TTACCTTGTG CTGCCTTGAC TGACCGAGAA TGCACTTGCC CACCTGGCAT GTTCCAGTCT 420 

AACGCTACCT GTGCCCCCCA TACGGTGTGT CCTGTGGGTT GGGGTGTGCG GAAGAAAGGG 480 

ACAGAGACTG AGGATGTGCG GTGTAAGCAG TGTGCTCGGG GTACCTTCTC AGATGTGCCT 540 

OTAgS T^AATGCAA AGCATACACA GACTGTCTGA GTCAGAACCT GGTGGTGATC 600 

AAGCCGGGGA CCAAGGAGAC AGACAACGTC TGTGGCACAC TCCCGTCCTT CTCCAGCTCC 660 

ACCTCACCTT CCCCTGGCAC AGCCATCTTT CCACGCCCTG AGCACATGGA AACCCATGAA 720 

GTCCCTTCCT CCACTTATGT TCCCAAAGGC ATGAACTCAA CAGAATCCAA CTCTTCTGCC 780 

TCTGTTAGAC CAAAGGTACT GAGTAGCATC CAGGAAGGGA CAGTCCCTGA CAACACAAGC 840 

TCAGCAAGGG GGAAGGAAGA CGTGAACAAG ACCCTCCCAA ACCTTCAGGT AGTCAACCAC 900 

CAGCAAGGCC CCCACCACAG ACACATCCTG AAGCTGCTGC CGTCCATGGA GGCCACTGGG 960 

GGCGAGAAGT CCAGCACGCC CATCAAGGGC CCCAAGAGGG GACATCCTAG ACAGAACCTA 1020 

CACAAGCATT TTGACATCAA TGAGCATTTG CCCTGGATGA TTGTGCTTTT CCTGCTGCTG 1080 

QTGCTTGTGG TGATTGTGGT GTGCAGTATC CGGAAAAGCT CGAGGACTCT GAAAAAGGGG 1140 

c^ogS a?Scagtcc CATTGTGGAA AAGGCAGGGC TGAAGAAATC CATGACTCCA 1200 

ACCCAGAACC GGGAGAAATG GATCTACTAC TGCAATGGCC ATGGTATCGA TATCCTGAAG 1260 

CTTGTAGCAG CCCAAGTGGG AAGCCAGTGG AAAGATATCT ATCAGTTTCT TTGCAATGCC 1320 

St^GAGGG AGGTTGCTGC TTTCTCCAAT GGGTACACAG CCGACCACGA GCGGGCCTAC 1380 

GCAGCTCTGC AGCACTGGAC CATCCGGGGC CCCGAGGGCA GCCTCGCCCA GCTAATTAGC 1440 

^CCCTGCGCC AGCACCGGAG AAACGATGTT GTGGAGAAGA TTCGTGGGCT GATGGAAGAC 1500 

ACCACCCAGC TGGAAACTGA CAAACTAGCT CTCCCGATGA GCCCCAGCCC GCTTAGCCCG 1560 

AGCCCCATCC CCAGCCCCAA CGCGAAACTT GAGAATTCCG CTCTCCTGAC GGTGGAGCCT 1620 

TCCCCACAGG ACAAGAACAA GGGCTTCTTC GTGGATGAGT CGGAGCCCCT TCTCCGCTGT 1680 

GACTCTACAT CCAGCGGCTC CTCCGCGCTG AGCAGGAACG GTTCCTTTAT TACCAAAGAA 1740 

AAGAAGGACA CAGTGTTGCG GCAGGTACGC CTGGACCCCT GTGACTTGCA GCCTATCTTT 1800 

GATGACATGC TCCACTTTCT AAATCCTGAG GAGCTGCGGG TGATTGAAGA GATTCCCCAG I860 

GCTGAGGACA AACTAGACCG GCTATTCGAA ATTATTGGAG TCAAGAGCCA GGAAGCCAGC 1920 
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CAGACOCTCC TGGACTCTGT TTATAGCCAT CTTCCTGACC TGCTGTAG 
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Seq ID NO: 76 Protein Sequence 
Protein Accession #: NP_055267.1 



MGTSPSSSTA 
TGQVLTCDKC 
LPCAALTDRE 
SSVMKCKAYT 
VPSSTYVPKG 
QQGPHHRHIL 
VLWIWCSI 
LVAAQVGSQW 
ALRQHRRNDV 
SPQDKNKGFP 
DDMLHFLNPE 



11 
1 

LASCSRIARR 
PAGTYVSEHC 
CTCPPGMFQS 
DCLSQNLWI 
MNSTESNSSA 
KLLPSMEATG 



KDIYQFLCNA 
VEKIRGLMED 
VDESEPLLRC 
ELRVIEEIPQ 



21 
I 

ATATMIAGSL 
TNTSLRVCSS 
NATCAPHTVC 
KPGTKETDNV 
SVRPKVLSSI 
GEKSSTPIKG 
PRQDPSAIVE 
SEREVAAFSN 
TTQLETDKIiA 
DSTSSGSSAL 
AEDKLDRIiFE 



31 
I 

LLLGFLSTTT 
CPVGTFTRHE 
PVGWGVRKKG 
CGTLPSFSSS 
QEGTVPDNTS 
PKRGHPRQNL 
KAGLKKSMTP 
GYTADHERAY 
LPMSPSPIiSP 
SRNGSFITKE 
IIGVKSQEAS 



41 

I 

AQPEQKASNL 
NGIEKCKDCS 
TETEDVRCKQ 
TSPSPGTAIF 
SARGKEDVNK 
HKHFDINEHL 
TQNREKWIYY 
AALQHWTIRG 
SPIPSPNAKL 
KKDTVLRQVR 
QTLLDSVYSH 



51 
I 

IGTYRHVDRA 
QPCPWPMIEK 
CARGTFSDVP 
PRPEHMETHE 
TLPNLQWKH 
PWMIVLFLLL 
CNGHGIDILK 
PEASLAQLIS 
ENSALLTVEP 
LDPCDLQPIF 
LPDliL 
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Seq ID NO: 77 DNA sequence 

Nucleic Acid Accession NM_003105.3 

Coding sequence: 123.. 6767 



GGCGCGGGCG 
TGGCCTCGCG 
ACATGGCGAC 
CACTGCTGCC 
CGCCCTTGCC 
TGTGGGCGOG 
AACGGAGCGC 
ATTCCCACAA 
TGGCCCGAGA 
ACTATGGAAA 
GTGAAGCTGT 
CAGACGCTTA 
TTTCCATCCC 
TGGGCTTTGA 
CCTGGATCAT 
AACCAAATAC 
GTACAGATTT 
TTCAGCTTCG 
AGCAGTCTTC 
TTGTCACAAG 
TTGTGTGTGT 
TGAAGTTCTC 
ACACCTTGGT 
TGCAAGGAGT 
CGGTCATCAC 
GATATGGAGA 
GCCTCAGTCA 
CTCCAGGCCT 
TGTACATCTC 
ACACATGGGG 
AGCTAAAATA 
CAGTGTTTGT 
TTGGCTCGAA 
CCTTGGGAGT 
GGAATGAGTG 
GCTTCAATGG 
AGGACTATGA 
TTCCAGATCC 
CTACTTACAG 
ATGTTGAAGC 
TCATTCTGTA 
AGCAGTTGCC 
ACTGTTTGTA 
GCACAGGGCA 
AACCCCTCAG 
ATCCAGATGG 
CTCTGGTCCT 
CTGGGATTTA 
TGAAGTGGCC 
TGGAGTGCAT 
TCCCGCACCC 
AGCTCAGCAT 
AGCTCACGGG 



11 
I 

GCCTGGAGCC 
CTGCACATTC 
ACGGAGCAGC 
GCCCGGAGCT 
CCAGGACCGG 
CGGGGATGCC 
TGCCCTGCAG 
TCAGATGGTG 
TAGCCTGGCA 
ATCATTCAAG 
TATCGCCCAG 
TGCCCAGTAC 
ATTTCGGGCA 
CAGGTCCCAC 
GATTCAGGAA 
CATCTACATT 
CTTCCAGTCC 
GGACAAGTAC 
TGTCCAGCTC 
ACATCCTATT 
CAGCCACAGT 
CCTGTCCTTG 
GAGGTATTTT 
CTACATTGCT 
CTTTGACAAA 
GAAAATCAAT 
GCTCCTCAAC 
CATCATCGCC 
TAGCAGTGCT 
AGACCACGGC 
CAGTACCAAT 
GTATGGCCTC 
CAAAGAGAAT 
TCCCTGCACA 
TTTGCTGGGA 
AGAGGACTTT 
GTGTGACTTC 
GGAATTTTCT 
GAGAACGAGA 
GCGACTGGAA 
TGCTGTGAGG 
TCTCACCGGG 
TTGGTCCGAC 
AGAGGTGATC 
CCAGCTGCTT 
CGACTTCCGA 
CGTGCCCCAA 
TCGGAGCAAT 
CAATGGCATC 
AGAGCGGATC 
CTATGCCATT 
ATTCCGAGCT 
GCTCATGGAC 



21 
I 

CCGGGAGCGG 
TCTCCTGGCG 
AGGAGGGAGT 
CTCTGCGAAG 
GGCTTCCTCG 
AGGGGGGCGA 
CCCGAGCCCA 
GTGCACTGGG 
TTGGCGAGGC 
AAAATTTCAG 
TTCTACCACA 
CTCTGGATCA 
GCTGATCTCC 
CCCAACAAGC 
CATGTCAAGT 
GAACGACACG 
CGGGAAAACC 
ATGTTTGCTA 
TGGGTCTCCT 
AATGAATATT 
AACAACCGCA 
GAGAACGTGC 
GCAAATGAAC 
ACTCTGATTA 
GGGGGAACCT 
TGTGAGCTTT 
CTCCAGCTCC 
ACTGGCTCAG 
GGAGOCAGGT 
GGAATCATCA 
GAAGGGGAGA 
CTCACAGAAC 
GTCCACAGCT 
GAGAATGACT 
CACAAGACTG 
GACAGGCCGG 
GGTTTCAAGA 
GGAAAGTCAT 
GGCTACCGGA 
GGAGAGCTGG 
AAATCCATCT 
CTACGGGCAG 
CTGGCCTTGG 
ATCAATTCTG 
TACTGGGTAG 
CTCACAATCG 
GAGGGGGTGA 
ATGGATGGTT 
TCTGTGGACG 
ACGTTCAGTG 
GCTGTCTTTA 
TCCAAATACA 
ATGAAGATTT 



31 



CGCGCGCGGT 
GCGGCGCCAC 
CGCGACTCCC 
TCTGGACGCA 
TGGTGCAGGG 
GCCGCGCGGA 
TCAAGGTGTA 
CTGGAGAGAA 
CCAAGAGCAG 
ACAAGTTAAA 
GCCCTGCGGA 
CGTTTGACTT 
TCCTACACAG 
AGCTGTGGAA 
CCTTTTCTTG 
AACCCTCTGG 
AGGAAGTGAT 
CAAAGGTGGT 
TTGGCCGGAA 
ACATCGCAGA 
CCAATTTATA 
TCTATTACAG 
CATTTGCTGA 
ATGGTTCTAT 
GGGAGTTTCT 
CCCAGGGCTG 
GGAGAATGCC 
TGGGAAAGAA 
GGCGAGAGGC 
CGGCCATTGC 
CCTGGAAAAC 
CTGGGGAGAA 
GGCTGATCCT 
ACAAGCTGTG 
TTTTCAAACG 
TGGTCGTGTC 
TGAGTGAAGA 
ACTCCCCTCC 
AGATTTCTGG 
TCCCCTGTCC 
ACCGCTATGA 
CAGTGGCCCT 
ACGTCATCCA 
GCCTGGAGAC 
ATGCAGGCTT 
TCAATTCCTC 
TGTTCTGGAC 
CTGCTGCCTA 
ACCAGTGGAT 
GCCAGCAGCG 
AGAATGAAAT 
GTGGGTCCCA 
TCTACAAGGG 



41 51 
I I 

CCCGGCCCAG CCGGCTCTCC 60 

CTGCAGTAGC GTTCGCCCGA 120 

GTTCCTATTC ACCCTGGTCG 180 

GAGGCTGCAC GGCGGCAGCG 240 

CGACCCGCGC GAGCTGCGGC 300 

CGAGAAGCCG CTCCGGAGGA 360 

CGGACAGGTT AGTCTGAATG 420 

AAGCAACGTG ATCGTGGCCT 480 

TGATGTGTAC GTGTCTTACG 540 

CTTTGGCTTG GGAAATAGGA 600 

CAACAAGCGG TACATCTTTG 660 

CTGCAACACT CTTCAAGGCT 720 

TAAGGCCTCC AACCTTCTCT 780 

GTCAGATGAC TTTGGCCAGA 840 

GGGAATTGAT CCCTATGACA 900 

CTACTCCACT GTCTTCCGAA 960 

CCTTGAGGAA GTGAGAGATT 1020 

GCATCTCTTG GGCAGTGAAC 1080 

GCCCATGAGA GCAGCCCAGT 1140 

TGCCTCCGAG GACCAGGTGT 1200 

CATCTCAGAG GCAGAGGGGC 1260 

CCCAGGAGGG GCCGGCAGTG 1320 
CTTCCACCGA GTGGAAGGAT 1380 

GAATGAGGAG AACATGAGAT 1440 
TCAGGCTCCA GCCTTCACGG 1500 
TTCCCTTCAT CTGGCTCAGC 1560 
CATCCTGTCC AAGGAGTCGG 1620 
CTTGGCTAGC AAGACAAACG 1680 
ACTTCCTGGA CCTCACTACT 1740 
CCAGGGCATG GAAACCAACG 1800 
ATTCATCTTC TCTGAGAAGC 1860 
GAGCACTGTC TTCACCATCT 1920 
CCAGGTCAAT GCCACGGATG 1980 
GTCACCATCT GATGAGCGGG 2040 
GCGGACCCCC CATGCCACAT 2100 
CAACTGCTCC TGCACCCGGG 2160 
TTTGTCATTA GAGGTTTGTG 2220 
TGTGCCTTGC CCTGTGGGTT 2280 
GGACACTTGT AGCGGAGGAG 2340 
CCTGGCAGAA GAGAACGAGT 2400 
CCTGGCCTCG GGAGCCACCG 2460 
GGACTTTGAC TATGAGCACA 2520 
GCGCCTCTGT TTGAATGGAA 2580 
AGTAGAAGCT TTGGCTTTTG 2640 
CAAAAAGATT GAGGTAGCTA 2700 
TGTGCTTGAT CGTCCCAGGG 2760 
AGACTGGGGA GACCTGAAGC 2820 
TCACCTGGTG TCTGAGGATG 2880 
TTACTGGACG GATGCCTACC 2940 
CTCTGTCATT CTGGACAACC 3000 
CTACTGGGAT GACTGGTCAC 3060 
GATGGAGATT CTGGCAAACC 3120 
GAAGAACACT GGAAGCAATG 3180 
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CCTGTGTGCC 
GCAGGTGTCC 
GCCCTCAGGG 
GCAACCAGTA 
ACAACGACTG 
ACACCCAGTT 
TTGAGGATGA 
GTGACGAGTA 
ACAACGACTG 
AGGCCTCCAA 
GGGATACGGA 
ATGGATTCCG 
GTGATTGCTC 
ACTTTGTGTG 
TCCAGTGCCG 
AGTTCCACAA 
TGATTTGGAA 
AAAACCCCAC 
ACTGCATCCC 
AGAAGGATTG 
TGCCCAATTA 
GGTACCGAGA 
CTGCTGCCTC 
AACCGAAGAC 
GCCGGGACGA 
AGTGCGAGGA 
GCTCGGACGA 
ATCTTCAGTG 
AAATGCCCTC 
GGAAGACTCT 
ATACCACGTA 
ACTTTGTGAC 
CACTCCCCAG 
ATGGCCTCAT 
CCCAGAGGGC 
CCGTCAGAGT 
TTACCACCAT 
AAAATTATCT 
TGAACCTTTT 
GAAGCATATT 
CCTGGGCCAA 
GGGTTCGCCC 
GTACCTGGAC 
ACCTCTATCG 
GTAAGGATGA 
CTGACTACGT 
TGGTTCATAC 
ACCAGGACTT 
ACAAAGTAAA 
GGAAATACCA 
CCACAGTTTC 
TTCTGTTTTG 
AGATACACAT 
ATTTCTTTAA 
GATGCCTTTT 
GGTCTGGTGC 
TGGTGCCCAT 
CGAAGCACCG 
GGCTGGGGTC 
CTATGATAAC 
ACTAGAAACC 
AAGATGCACT 
AAAAAA 



CAGGCCATGC 
AGAGGATGTG 
CTATCAGCTC 
TCGCTGCAGC 
TGGAGACATG 
TCGTTGCCAG 
CTGTGGAGAC 
CAACTGCAGT 
CAGGGACTGG 
CTTCCAGTGC 
CTGCCAGGAT 
CTGCCCAAAC 
TGATGGCTCC 
TAAGAACCGC 
CGACGGGTCC 
GGTATGTGAT 
GTGCGACGGG 
AGAAGCCCCA 
CAACAGATGG 
TGGAGATTCA 
CTACCGCTGC 
TTGTGCAGAT 
CACTCCCACC 
GTGTATTCCC 
GGCCAATTGC 
CGGGGAGGCC 
GAGCGATGAA 
GACAGCTGAC 
TGCTTCTTGT 
GGAGACCCAC 
TCAGGTTAAA 
CCTGAGGACC 
GGAAGCAGAA 
CCGTGAGTAC 
TGCTAGTAAC 
GGCTGCGGTG 
AAAAGGAAAA 
AAGCTTCACC 
CTGGGCATTT 
GTCACACAAA 
GACTGACTTG 
ACCTGCACCT 
CGGCCCCCGG 
CAACCCGAAG 
GCAGTATTTG 
TGTAGTGAAG 
GGGCAAAACC 
GTTGTATGCA 
ATCCCGTAAC 
CATCATTGTC 
ATTATCAGCA 
GAAAAGCCTG 
GTTTGATAGT 
AATTTCCAAC 
TGGCAACCAG 
AGATGCATCT 
CTTATTCCTG 
GAGGCTGCAG 
CGCAATCTTC 
TGGATTTTCA 
AAATGGTGTA 
TTGAGTTGCA 



AGCCTGCTGT 
TCCAGCAGTG 
AAGAACAATA 
AACGGGAACT 
AGCGATGAGA 
GAGTCTGGGA 
AACAGTGATG 
TCCGGCATGT 
TCTGATGAAG 
CGAAACGGGC 
GGTTCCGATG 
GGCACTTGCA 
GATGAACAGC 
CAGCAGTGCC 
GATGAGGATG 
GAGTTCGGTT 
ATGGATGATT 
AACTGCTCCC 
AAATGTGACA 
CATATTCTTC 
AGCAGTGGGA 
GGCTCTGACG 
CAACTTGGGC 
AACTGGAAGC 
CCCACACACA 
TGCATTGTGC 
AAGGCCTGCA 
TTCTCTGGGG 
GTATATAATG 
AGCAATAAGA 
GTACAGGTTC 
CCAGAGGGAT 
GGTGTGATTG 
ATTGTAGAAT 
TTTACAGAAA 
ACTAGTCGTG 
GTGATCCCAC 
CTGACCATGG 
GACACCCACA 
GTTGGCAATC 
GGGGATAGCC 
AGCCTCAAGG 
AATGTGGTTT 
AGCTTGACTA 
TTTCTGGTCC 
ATGATCCCGG 
TCCGTGGTCA 
ATTGCAGTCA 
AGCACTGTGG 
CAACTGGGGA 
CCTGATGCCT 
GCTTTAAAGG 
GCCATGAATA 
CTGAAGATGG 
ATCTGTGGGG 
GCAACGCAGG 
ATACTGCTGA 
AGCAGCTTCA 
TCCTCTGGGG 
GATGACGTCC 
AATATTTTAT 
ATATGTTATT 



GCCTGCCCAA 
TGCTTCCATC 
CCTGTGTCAA 
GTATCAACAG 
GAAACTGCCC 
CTTGTATCCC 
AAAGTCATTG 
GCATCCGCTC 
CCAACTGTAC 
ACTGCATCCC 
AGGATCCAGT 
TCCCATCCAG 
ACTGCGAGCC 
TGTTCCACTC 
CGGCGTTTGC 
TCCAGTGTCA 
GCGGCGATTA 
GCTACTTCCA 
GGGAGAACGA 
CCTTCTCGAC 
CCTGCGTGAT 
AGGAAGCCTG 
GATGTGACCG 
GCTGTGACGG 
GCACCTTGAC 
TCTCGGAGCG 
GTGATGAGTT 
ATGTGACTTT 
TCTACTACAG 
CAAACACTGT 
AGTGTCTCAG 
TGCCAGATGC 
TAGGCCACTG 
ACAGCAGGAG 
TCAAGAACTT 
GAATAGGAAA 
CACCAGATAT 
AGAGTGATAT 
AGCAAGAGAG 
TGACAGCTCA 
CTCTGGCATT 
CCAAAGCCAT 
ATGGTATTTT 
CTTCACTCCA 
GTGTAGTGGT 
ACAGCAGGCT 
TCAAGTGGGA 
AAGATCTCAT 
AATACACCCT 
ACATGAGCAA 
TAAAAATCAT 
AAAAGCATTT 
TCACAGCTTA 
GTCATAATTA 
AGCCTGCCAT 
CTGCCAGATC 
GCCTGGGGGT 
CCGCCTTCGC 
ATGACCTGGG 
CCATGGTGAT 
TTGATAAAGA 
TTTATATGGG 



GGCCAACAAC 
AGGGGACCTG 
AGAAGAGAAC 
CATTTGGTGG 
TACCACCATC 
ACTGTCCTAT 
TGAAATGCAC 
CTCCTGGGTA 
CGCCATCTAT 
CCAGCGGTGG 
CAACTGTGAG 
CAAACATTGT 
CCTCTGTACG 
CATGGTCTGT 
AGGATGCTCC 
GAATGGAGTG 
TTCTGATGAA 
GTTTCGGTGT 
CTGTGGGGAC 
TCCTGGGCCC 
GGACACCTGG 
CCCCTTGCTT 
ATTTGAGTTC 
CCACCAAGAT 
TTGCATGAGC 
CTGCGACGGC 
GACTGTGTAC 
GACCTGGATG 
GGTGGTTGGA 
ATTAAAAGTC 
CAAGGCACAC 
CCCTCGAAAT 
GGCTCCTCCC 
TGGTTCCAAG 
ATTGGTCAAC 
CTGGAGCGAT 
CCACATTGAC 
CAAGGTGAAT 
GAGAACTTTG 
TACATCCTAT 
TGAGCATGTT 
CAACCAGACT 
CTATGCCACG 
CAACAAGACG 
ACCCTACCAG 
TCCACCCCGT 
ATCACCGTAT 
AAGAAAGACT 
TAACAAGTTG 
AGATTCCAGC 
AACAGAAAAT 
TAATGAAAGC 
CCTTGGGAAT 
CACGTTCACC 
CCTGCTGTAC 
TACGGATGTT 
GGGGTTTGCC 
CAACAGCCAC 
GGAAGATGAT 
AGCCTGAAAG 
TAGTTGATGG 
CCAAAAACAA 



AGTAGAAGCT 
ATGTGTGACT 
ACCTGTCTTC 
TGTGACTTTG 
TGTGACCTGG 
AAATGTGACC 
CAGTGCCGGA 
TGTGACGGGG 
CACACCTGTG 
GCGTGTGACG 
AAGAAGTGCA 
GATGGTCTGC 
CACTTCATGG 
GACGGAATCA 
CAAGATCCTG 
TGCATCAGTT 
GCCAACTGCG 
GAGAATGGCC 
TGGTCTGATG 
TCCACGTGTC 
GTGTGCGACG 
GCAAACGTCA 
GAATGCCACC 
TGCCAGGATG 
AGGGAGTTCC 
TTCCTGGACT 
AAAGTACAGA 
AGGCCCAAAA 
GAGAGCATAT 
TTGAAACCAG 
AACACCAATG 
CTCCAGCTGT 
ATCCACACCC 
ATGTGGGCCT 
ACTCTATACA 
TCTAAATCCA 
AGCTATGGTG 
GGCTATGTGG 
AACTTCCGAG 
GAGATTTCTG 
ATGACCAGAG 
GCAGTGGAAT 
TCCTTTCTTG 
GTCATTGTCA 
GGGCCATCCT 
CACCTGCATG 
GACTCTCCTG 
GACAGGAGCT 
GAGCCTGGCG 
ATAAAAATTA 
GATCATGTTC 
AGGGGCTATG 
ACTACTGACA 
GTCCAAGCAA 
GATGAGCTGG 
GCTGCTGTGG 
ATCCTGTACA 
TACAGCTCCA 
GAAGATGCCC 
AGCTTTCCTC 
TTTATTTTAA 
AAAACAAAAA 



Seq ID NO: 78 Protein Sequence 
Protein Accession tt: NP_003096.1 
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MATRSSRRES 
WARGDARGAS 
ARDSLALARP 
DAYAQYLWIT 
WIMIQEHVKS 
QLRDKYMFAT 
VCVSHSNNRT 
QGVYIATUN 
LSQLLNLQLR 
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RLPFLFTLVA 
RADEKPLRRK 
KSSDVYVSYD 
FDPCNTLQGP 
FSWGIDPYDK 
KWHLLGSEQ 
NLYISBAEGI* 
GSMNEENMRS 
RMPILSKESA 
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LLPPGALCEV 
RSAALQPEPI 
YGKSFKKISD 
SIPFRAADLL 
PNTIYIERHE 
QSSVQLWVSF 
KFSLSLBNVL 
VITFDKGGTW 
PGLIIATGSV 



31 
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WTQRLHGGSA 
KVYGQVSLND 
KLNFGLGNRS 
LHSKASNLLL 
PSGYSTVFRS 
GRKPMRAAQF 
YYSPGGAGSD 
EFLQAPAFTG 
GKNLASKTNV 



41 
I 

PLPQDRGPLV 
SHNQMWHWA 
EAVIAQFYHS 
GFDRSHPNKQ 
TDFFQSREKQ 
VTRHPINEYY 
TLVRYFAHEP 
YGEKINCELS 
YISSSAGARW 



3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3B40 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 
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VQGDPRELRL 
GEKSNVIVAL 
PADNKRYIFA 
LWKSDDFGQT 
EVILEEVRDF 
IADASEDQVF 
PADFHRVEGL 
QGCSLHIAQR 
REALPGPHYY 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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TWGDHGGI IT AIAQGMETNE LKYSTNEGET WKTFIFSEKP VFVYGLLTEP GEKSTVFTIF 600 

GSNKENVHSW LILQVNATDA LGVPCTENDY KLWSPSDERG NECLLGHKTV FKRRTPHATC 660 

FNGBDFDRPV WSNCSCTRE DYECDFGFKM SEDLSLEVCV PDPEFSGKSY SPPVPCFVGS 720 

TYRRTRGYRK ISGDTCSGGD VEARLEGELV PCPLAEENEF ILYAVRKSIY RYDLASGATE 780 

5 QLPLTGLRAA VALDFDYEHN CLYWSDLALD VIQRLCLNGS TGQEVIINSG LETVEALAFE 840 

PLSQIiLYWVD AGFKKIEVAN PDGDFRLTIV NSSVLDRPRA LVLVPQEGVM FWTDWGDLKP 900 

GIYRSNMDGS AAYHLVSEDV KWPKGISVDD QWIYWTDAYL ECIERITFSG QQRSVILDNL 960 

PHPYAIAVFK NEIYWDDWSQ LSIFRASKYS GSQMEILANQ LTGLMDMKIF YKGKNTGSNA 1020 

CVPRPCSLLC LPKANNSRSC RCPEDVSSSV LPSGDLMCDC PQGYQLKNNT CVKEENTCLR 1080 

10 NQYRCSNGNC INSIWWCDFD NDCGDMSDER NCPTTICDLD TQFRCQESGT CIPLSYKCDL 1140 

EDDOGDNSDE SHCEMHQCRS DEYNCSSGMC IRSSWVCDGD NDCRDWSDEA NCTAIYHTCE 1200 

ASNFQCRNGH CIPQRWACDG DTDCQDGSDE DPVNCEKKCN GFRCPNGTCI PSSKHCDGLR 1260 

DCSDGSDEQH CEPLCTHFMD FVCKNRQQCL FHSMVCDGII QCRDGSDEDA AFAGCSQDPE 1320 

FHKVCDEFGF QCQNGVCISL IWKCDGMDDC GDYSDEANCE NPTEAPNCSR YFQFRCENGH 1380 

15 CIPNRWKCDR ENDCGDWSDE KDCGDSHILP FSTPGPSTCL PNYYRCSSGT CVMDTWVCDG 1440 

YRDCADGSDE EACPLLANVT AASTPTQLGR CDRFEFECHQ PKTCIPNWKR CDGHQDCQDG 1500 

RDEANCPTHS TLTCMSREFQ CEDGEACIVL SERCDGFLDC SDESDEKACS DKLTVYKVQN 1560 

LQWTADFSGD VTLTWMRPKK MPSASCVYNV YYRWGESIW KTLETHSNKT NTVLKVLKPD 1620 

TTYQVKVQVQ CLSKAHNTND FVTLRTPEGL PDAPRNLQLS LPREAEGVIV GHWAPPIHTH 1680 

20 GLIREYIVEY SRSGSKMWAS QRAASNFTEI KNLLVNTLYT VRVAAVTSRG IGNWSDSKSI 1740 

TTIKGKVIPP PDIHIDSYGE NYLSFTLTME SDIKVNGYW NLFWAFDTHK QERRTLNFRG 1800 

SILSHKVGNL TAHTSYEISA WAKTDLGDSP LAFEHVMTRG VRPPAPSLKA KAINQTAVEC 1860 

TWTGPRNWY GIFVATSFLD LYRNPKSLTT SLHNKTVIVS KDEQYLFLVR WVPYQGPSS 1920 

DYVWKMIPD SRLPPRHLHV VHTGKTSWI KWESPYDSPD QDLLYAIAVK DLIRKTDRSY 1980 

25 KVKSRNSTVE YTLNKLEPGG KYHIIVQLGN MSKDSSIKIT TVSLSAPDAL KIITENDHVL 2040 

LFWKSLALKE KHFNESRGYE IHMFDSAMNI TAYLGNTTDN FFKISNLKMG HNYTFTVQAR 2100 

CLFGNQICGE PAILLYDELG SGADASATQA ARSTDVAAW VPILFLILLS LGVGFAILYT 2160 
KHRRLQSSFT AFANSHYSSR LGSAIFSSGD DLGEDDEDAP MITGFSDDVP MVIA 

30 Seq ID NO j 79 DNA sequence 

Nucleic Acid Accession fh Eos sequence 
Coding sequence: 12 0-755 

1 11 21 31 41 51 
35 ) | I I I I 



75 



CCTGCCACTC CCCTGCCAGT GTGCGAGGCA GCGTGAAGCT GGGGCCTGCT CCCCGCAGCC 60 

TCTGGAGCGC ATCTCAGACC TTCTGAGACC TATGTTGCTG GCCCCCCAGA ACCCGCAACA 120 

TGGCAGATGG GGCAAAGGCC AACCCCAAAG GGTTCAAAAA GAAGGTGCTG GATAGATGCT 180 

TCTCTGGGTG GAGGGGCCCA CGCTTCGGGG CCTCCTGTCC TTCAAGAACC TCCAGGTCTA 240 

40 GCCTGGGTAT GAAGAAGTTC TTCACCGTGG CCATCCTTGC TGGCAGCGTT CTGTCCACAG 300 

CICACGGCAG CCTGCTCAAC CTGAAGGCCA TGGTGGAGGC CGTCACAGGG AGGAGCGCCA 360 

TCCTGTCCTT CGTGGGCTAC GGTTGCTACT GTGGGCTGGG GGGCCGTGGC CAGCCCAAGG 420 

ATGAGGTGGA CTGGTGCTGC CACGCCCACG ACTGCTGCTA CCAGGAACTC TTTGACCAAG 480 

GCTGTCACCC CTATGTGGAC CACTATGATC ACACCATCGA GAACAACACT GAGATAGTCT 540 

45 GCAGTGACCT CAACAAGACA GAGTGTGACA AGCAGACATG CATGTGTGAC AAGAACATGG 600 

TTCTGTGCCT CATGAACCAG ACGTACCGAG AGGAGTACCG TGGCTTCCTC AATGTCTACT 660 

GCCAGGGCCC CACGCCCAAC TGCAGCATCT ATGAACCGCC CCCTGAGGAG GTCACCTGCA 720 

GTCACCAATC CCCAGCGCCC CCCGCCCCTC CCTAGAGCCT CTGAGGTTTG AGAGAGAGAG 780 

CGGGAGGAGG GTCTGGCTTG GGGACCAGAC GAGGTGCAGG GAGGGTAGGA GCCAGGCCAG 840 

50 GAGCCTGAGG GTTGCTGGTT GCCTCCTCCC TGGAGCTCTC CAGTGAGGGC TCAGCTCTCA 900 

GAGGACTCAG GAAGGCCTGG GTCCTGACTC CCCCAGCCCA GCCCCAGGCA TGGGTGCCTC 960 

CTCCTGCTGG TTCTGGACTG GGTGGGAGGC ACGGAGCTTA TAGGGGTCTC TCCTGAGGGT 1020 

GGCCGGGGAG ACCTGAGAGA GAGGAGGAGG GGCCTCTGAG TGGGGCCTCT GTTGCTGGCG 1080 

CCAGTTTAAC TCCCCGGAGC CTTAGAAAGT CTGAGCCTTA GAAAGTCTGG GCCTGAGCAT 1140 

55 ccS£g aSatgc ATCCTCGGCC CAAGATCACA GGAAGGCAGA TI^GTCA 1200 

CAAAGCATGG GTTCCGGGAG CCCCTCAGCT GATCCCACAG GATGGCCTGG GGTGGTGGCT 1260 

ACTTTGGGCT TGAAGCTCTC TAGAGCCCCA TTCACAGAGC AGGTGCCCCA CCCCAGCCCT 1320 

CACTGAGGTG GCCATGAGTC CAGGTTCAGA CCCTAGGGGA TGTGCAGTGC TGAGTGTCCC 1380 

AGCCCTGAAG TCTAGCCTCA GAGCAGGCTC CTGGGCACTG ACAGAGCCAC ACACCCAGGT 1440 

60 ACACCCCCCA GGCTAATATG GGGACACACA AAAATATGGC CTTCAACACC TACAAGTCTT 1500 

CTCTGTGTCC CCCATCGACC CATGCACATC ACCCCAACCC TGCTCTGCCA CATCACGTGG 1560 

GTCTGAGGAG CACTCCTACC CCCCGGCTCC GGGTCCCTGA CAGACACTGC CCTCCTGGCC 1620 

TGCACCTGGA ATGGCAGCCA GAACACTGGA GGGGGACCCA CACCCTCTCT CCCTGGGCTG 1680 

« T^C^AGTCC CATCCCTCCT GGCCTGCACC TGGAACAGCA C^GAACAC 1740 

65 CGGAGGgSa CCCACACCCT CTCTCCCTGG GCTGTGGGCA GGCGTCACAA GTCCCATTGG 1800 

« GCTGAGGGCT GAGACCCACC TCAGTGAAGA GGGAAACGTA AGCAGGGAGA 1860 

GCATCAAGGC GGGAGGCAGC CTGGAAACTT CTAGAAGAGG GTCAAGGTAG GAGGCTACAT 1920 

qqAAGCTTCT AGAAGGCAGT AAGGAACAGG GTGGTGAAGG AGAGGGGAGA AGATGGGAGC 1980 

ATGGAGAGAC GAGGTGAGTG ACAGCCACCA GCAGGGACGG GCCTCCAGGC AACACAGAGG 2040 

70 AGCTGGGGGC CaSgCCTTA CGGGGCTGGG TTCCTGTGTG GGGCGTGGAG CWtfGGCTO* 2100 

MTGGAGGGG ACGGCCCTGC GGCCCCCAGC ACTGTTGTTT CTCCAGGTGC ACCTTACCTT 2160 

cctc^tta^c tgcccgagcc TCAACTCAGG GTCAGCAGCC tccacggagc CAGCCCCACC 2220 

TGCCCGAAAA CCACTGAAGC CACCCTCCGC CCGACGTCAA CGGGGATCCA CTAGTTTAGA 2280 



GCGCGGCCCC CGCGTGC 

Seq ID NO: 80 Protein sequence: 
Protein Accession &: Eos sequence 
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1 11 21 31 41 51 

I I I I I I 

MADGAKANPK GFKKKVLDRC FSGWRGPRFG ASCPSRTSRS SLGMKKFPTV AILAGSVLST 60 
AHGSLLNLKA MVEAVTGRSA ILSFVGYGCY CGLGGRGQPK DEVDWCCHAH DCCYQELPDQ 120 
GCHPYVDHYD HTIENNTEIV CSDLNKTECD KQTCMCDKNM VLCLMNQTYR EEYRGFLNVY 180 
CQGPTPNCSI YEPPPEEVTC SHQSPAPPAP P 

Seq ID NO: 81 DNA sequence 

Nucleic Acid Accession gi NM_018136.1 

Coding sequence: 3 8.. 2218 



GAAATCTGTA 
AAATGTATCA 
AAGCAAAACT 
AGCTTCTGGC 
AACAATTCAA 
TGCCCTACGG 
GAAAAGAGCT 
TTTACTATAT 
AAAACATCAA 
AAGTAAAGGA 
TCAGGCTATG 
CAAGATTCAA 
AGCTGTTAAA 
TGTGAGAGCA 
AGCTCATGAA 
TTATAGAGGA 
ACAAAAATAT 
TAAAAAATCT 
TTTAGAACAG 
GAATGCTGTT 
GCAGGTTAAT 
ATTTATTCAG 
CCAGCGAAAT 
AAAGCAGGAA 
TTGGAGGAAG 
TAATAGGGAG 
CCTTTTGACA 
TACTAGATTG 
ATTTGTTTTG 
TGTGCAAGTC 
AGAAAATTGT 
TAAAGTTGCA 
ACTGAAGACA 
TTACAGTCTC 
CAAGCAAAAG 
CAGAATAGTT 
CACAAATCCC 
AATGTAAACA 
AAAGTGATTT 
AAACTGTTCA 
TATGTCATAA 



11 
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CGAACTATTC 
GAGGAAAAGA 
CAGTATGAAG 
CTTGCTTGTT 
AAAGCTTTTT 
ATTCAGTTCT 
GCTATCACTT 
AGAAAAGCAG 
AGACAAGTCT 
TTTATACAGA 
TGGAGGAGAT 
GCCTGGTATA 
ATTATTCAAG 
TCAGCAATTA 
CACTTCTTAA 
TATAAAGGAA 
ATACGAGCCA 
ACAGTTATCC 
AGAGCCAAAA 
AGAATTCAAA 
TCAGTCATCT 
AAATATCATA 
AGGGCTGCAT 
AAATTCACTA 
AAAAATGATT 
ATTCGAGAAG 
TATAAGCACC 
TCTCCACTTT 
ATCCGAAGTT 
TTGCTTAATG 
ATAGATATAC 
GACAAAGGCG 
ACAAATAGAG 
TACAAACTTA 
AAGAATTCTT 
TCAAGACTTA 
CTGCAAGCTA 
TTTTCAGTAT 
TTGCTCTCTG 
TTACATTCTT 
GATG 



21 
I 

AGGCTGCTTT 
TGGCAGCCAT 
CTGTTCAAAG 
CACAGGAAGC 
GTAGAATGGT 
TCCTTCAGAT 
TACAGCATTA 
CAGTGGTTTT 
ATTTACAGAT 
AACGGAAGTT 
ATAGAGCCAA 
GATGTTGGAG 
GTTGCTTCTA 
TCATTCAGAG 
TGATAAAAAG 
GGCAGGTCTC 
GGGAGGCTGG 
TACAAGCACT 
TTCGACTTCT 
GAGCCTATAA 
GTATTCAGAG 
GCATCAAAAA 
CAGTAATACA 
GTGGAATCAT 
GTACAAAAAT 
AAAACAAACT 
TTTCTGCCAT 
GTTGTGAGAA 
GTAATCGCAG 
TATCTAAGTA 
TATTGGAGCT 
GAAGCATTTT 
CCTCTGATGT 
CAGCTCATAA 
CTATAAGCAT 
AGCCAGATTG 
TTCAAATGGT 
GTATAGTGTA 
TGTACAACTT 
CATTTTTATC 



31 
I 

TAGAGGCATG 
. TGTTAACCAA 
TGAAGGTGTT 
AGAGTATCAT 
CACAAGAAAA 
GGCTGTGTAT 
TTTTAGGACG 
ACAAAATCAC 
CAGAAGCAGT 
TCAGGAAATT 
GAAATATTTA 
AGCACACAAA 
TACCAAACTA 
AAAATGGAGA 
ACATCGAGCT 
TCTTCGGCAG 
AAAGCATGAA 
GGTGCGTGGT 
TCACTTCACT 
ACTTTACCTG 
ATGGTTTCGA 
GATTGAGCAT 
GAAAGCAGTG 
TAAAATTCAG 
TAAAGCTATA 
CTACAAAAGA 
TCTTGAGGCC 
CATGGCCCAG 
TATTCCTTGT 
TGAGAAAACT 
TTTGCAGATA 
TACAAAAACT 
ACGAAGTAGG 
ACATAAAATG 
TCCTTTTATC 
GGTTTTGAGA 
GATGGATACG 
AAGAAATATT 
TTAAAATCTG 
ATTTATAGTT 



41 



AAAGTTAGAC 
TCTGCACTCT 
ATGATTCAAG 
TCTCAAAGTA 
CTGGAAACAC 
CGGAGAAGAT 
TGGCAAACCA 
TACAGAGCAT 
GTTATCATTA 
AAAAATAGCA 
TGTAAAGTGA 
GAATATCTAG 
GAGAGAACAC 
GCTATACTTC 
GCTTGTTTGA 
AAATCTGCTG 
AGGATAAAAT 
TGGCTAGTAC 
GCAGCTGCAT 
GCTGTGAAGA 
GCAAGATTAC 
GAAGGTCAAG 
CGCCATTTTC 
GCATTATGGA 
CGACTAAGTC 
ACTGCACTTG 
TTAAAACACC 
AGTGGAGCAA 
ATGGAAGTCA 
ACTTCAGCAG 
TACCGAGAAA 
TGTTGTTTGT 
TCCAAAGTTG 
AATACTGAAA 
CCAGAAACAC 
AGAGATAACA 
CTTGGCATTC 
AAAGCCAATC 
ACTTTGTTTT 
TTATGCATGT 



51 
1 

AAAAATTGAA 
GCTGTTACAG 
AGTGGTATAA 
GGGCTGCAGT 
AGAAATGTGC 
TTGTTCAGCA 
GAAAACAGTT 
TTCTGTCTGC 
TTCAAGCTAG 
CCATAAAAAT 
AAGCTGCCTG 
CTGTATTAAA 
GGTTTTTGAA 
CTGCAAAGAT 
TCCAAGCACA 
CTTTGATCAT 
ATATTGAATT 
GAAAAAGATT 
ATTATCACCT 
ATGCTAACAA 
AAGAAAAGAG 
AATGTCTGAG 
TCCTCCGTAA 
GAGGCTATTC 
TTCAAGTTGT 
CACTTCATTA 
TAGAGGTAGT 
TTTCTAAAAT 
TCAGATATGC 
TTTATGATGT 
AGCCTGGTAA 
TGGCTATTTT 
TTGACCGTAT 
GAATACTTTA 
CTGTAAGGAC 
TGGAAGAAAT 
CTTATTAGTA 
ATGAGTACGT 
AAAAAAACAT 
AATAAACTAA 



Seq ID NO: 82 Protein Sequence 
Protein Accession ft: NP_060606.1 



1 
I 

MKVRQKLKNV 
HSQSRAAVTI 
TWQTRKQPLL 
IKNSTIKIQA 
LERTRPLNVR 
QKSAAL.IIQK 
TAAAYYHLNA 
HEGQECLSQR 
IRLSLQWNR 
QSGAISKIFV 
IYREKPGNKV 
MNTERILYXQ 
TLGIPY 



11 
I 

SEEKMAAIVN 
QKAFCRMVTR 
YRKAAWLQN 
MWRRYRAKKY 
ASAIIIQRKW 
YIRAREAGKH 
VRIQRAYKLY 
NRAASVIQKA 
EIREENKLYK 
LIRSCNRSIP 
ADKGGSIFTK 
KKNSSISIPF 



21 
I 

QSALCCYRSK 
KLETQKCAAL 
HYRAFLSAKH 
LCKVKAACKI 
RAILPAKIAH 
ERIKYIEFKK 
LAVKNANKQV 
VRHPLLRKKQ 
RTALALHYLL 
CMEVIRYAVQ 
TCCLLAILLK 
IPETPVRTRI 



31 
I 

TQYEAVQSEG 
RIQFFLQMAV 
QRQVYLQIRS 
QAWYRCWRAH 
EHFLMIKRHR 
STVILQALVR 
NSVICIQRWF 
EKFTSGIIKI 
TYKHLSAILE 
VLLNVSKYEK 
TTNRASDVRS 
VSRLKPDWVXi 



41 
I 

VMIQEWYKAS 
YRRRFVQQKR 
SVIIIQARSK 
KEYIAVLKAV 
AACIjIQAHYR 
GWLVRKRFLE 
RARLQEKRFI 
QALWRGYSWR 
ALKHLEWTR 
TTSAVYDVEN 
RSKWDRIYS 
RRDNMEEITN 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 



51 
I 

GLACSQEAEY 
AAITLQHYFR 
GPIQKRKFQE 
KIIQGCFYTK 
GYKGRQVSLR 
QRAKIRLLHF 
QKYHSIKKIE 
KKNDCTKIKA 
LSPLCCENMA 
CIDILLELLQ 
LYKLTAHKHK 
PLQAIQMVMD 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 



Seq ID NO: 83 DNA sequence 

Nucleic Acid Accession ft: NM_000612.2 

Coding sequence: 5S3..1095 



261 



WO 03/003906 



PCT/US02/21338 



1 11 21 31 41 51 

I I I I I 1 

TTCTCCCGCA ACCTTCCCTT CGCTCCCTCC CGTCCCCCCC AGCTCCTAGC CTCOGACTCC 60 

CTCCCCCCCT CACGCCCGCC CTCTCGCCTT CGCCGAACCA AAGTGGATTA ATTACACGCT 120 

5 TTCTGTTTCT CTCCGTGCTG TTCTCTCCCG CTGTGCGCCT GCCCGCCTCT CGCTGTCCTC 1B0 

TCTCCCCCTC GCCCTCTCTT CGGCCCCCCC CTTTCACGTT CACTCTGTCT CTCCCACTAT 240 

CTCTGCCCCC CTCTATCCTT GATACAACAG CTGACCTCAT TTCCCGATAC CTTTTCCCCC 300 

CCGAAAAGTA CAACATCTGG CCCGCCCCAG CCCGAAGACA GCCCGTCCTC CCTGGACAAT 360 

CAGACGAATT CTCCCCCCCC CCCCAAAAAA AAAAGCCATC CCCCCGCTCT GCCCCGTCGC 420 

10 ACATTCGGCC CCCGCGACTC GGCCAGAGCG GCGCTGGCAG AGGAGTGTCC GGCAGGAGGG 480 

CCAACGCCCG CTGTTCGGTT TGCGACACGC AGCAGGGAGG TGGGCGGCAG CGTCGCCGGC 540 

TTCCAGACAC CAATGGGAAT CCCAATGGGG AAGTCGATGC TGGTGCTTCT CACCTTCTTG 600 

GCCTTCGCCT CGTGCTGCAT TGCTGCTTAC CGCCCCAGTG AGACCCTGTG CGGCGGGGAG 660 

CTGGTGGACA CCCTCCAGTT CGTCTGTGGG GACCGCGGCT TCTACTTCAG CAGGCCCGCA 720 

15 AGCCGTGTGA GCCGTCGCAG CCGTGGCATC GTTGAGGAGT GCTGTTTCCG CAGCTGTGAC 780 

CTGGCCCTCC TGGAGACGTA CTGTGCTACC CCCGCCAAGT CCGAGAGGGA CGTGTCGACC 840 

CCTCCGACCG TGCTTCCGGA CAACTTCCCC AGATACCCCG TGGGCAAGTT CTTCCAATAT 900 

GACACCTGGA AGCAGTCCAC CCAGCGCCTG CGCAGGGGCC TGCCTGCCCT CCTGCGTGCC 960 

CGCCGGGGTC ACGTGCTCGC CAAGGAGCTC GAGGCGTTCA GGGAGGCCAA ACGTCACCGT 1020 

20 CCCCTGATTG CTCTACCCAC CCAAGACCCC GCCCACGGGG GCGCCCCCCC AGAGATGGCC 1080 

AGCAATCGGA AGTGAGCAAA ACTGCCGCAA GTCTGCAGCC CGGCGCCACC ATCCTGCAGC 1140 

CTCCTCCTGA CCACGGACGT TTCCATCAGG TTCCATCCCG AAAATCTCTC GGTTCCACGT 1200 

CCCCCTGGGG CTTCTCCTGA CCCAGTCCCC GTGCCCCGCC TCCCCGAAAC AGGCTACTCT 1260 

CCTCGGCCCC CTCCATCGGG CTGAGGAAGC ACAGCAGCAT CTTCAAACAT GTACAAAATC 1320 

25 GATTGGCTTT AAACACCCTT CACATACCCT CCCCCC 

Seq ID NO: 84 Protein Sequence 
Protein Accession #: NP_000603.1 

30 1 11 21 31 41 51 

MGIPWGKSML VLLTFLAFAS CCIAAYRPSE TLCGGELVDT LQFVCGDRGP YFSRPASRVS 60 
RRSRGIVEBC CFRSCDLALL ETYCATPAKS BRDVSTPPTV LPDNFPRYPV GKFFQYDTWK 120 
QSTQRLRRGL PALLRARRGH VLAKELEAFR EAKRHRPLIA LPTQDPAHGG APPEMASNRK 180 



35 
40 



Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession #: XM_092643.1 

Coding sequence: 1-5352 



11 21 31 41 51 

ATGACTGAAG CCATTACAGA AGCAGCAGTA GCCTCAAGTG AGGAGGTGTC AGAGGAAAGA 60 

GACGACCTGG GGCCTCTTGA ATTACATGAC AGTGGTACCT TTCAACAAGT CGTGAACCTT 120 

45 CTGGACATCA TTGACAGCGA GTCAGCAAAG ACGGACACAA CAGGGGCAGG CCTTGACATG 180 

CGGAAGACCC TGGCCTCGGT GATAATCATG GAGAAGGCCA CCACTGAGCC TTCTGTAGTG 240 

ATAAACACTC TCATCCGCTG CCTGCAGGTG CCAGAGATTT CCACCCAGCG CAAGGTCAAC 300 

ATTTACAACA TCCTCCAGGA CATCATCCAG CAGGAGGGGG AGCTGGAGGA GCAGTGCGTG 360 

CAGAGGCTGG TGGCCATTGC CTCCAAGGAG ATGAGGGAGA TCCCAGAGAT GGAGGGCTAT 420 

50 ATGAAGGCAG AGGTGGCCAG CGACACACTG GTGGCTCTGT CCCGAAACCA CTTCAGCTTG 480 

GTCATGTACG AGCTGCAGCA CCACCTCAAG CCCCTCAACC TCACTGATGA ATTTGTCATC 540 

ATCACACTGG CCAAGCTGGC CAACGGCAAT GTGTTTGAGT TCATGCCATA CATGGGCATC 600 

ACCCTGGCTA CCATATTCAC CATGCTGAGA CTTGCCAATG AAGCCAAGAT ACGCCAGGCG 660 

ATCTGCAGTG CCATGGAGAC CTTCTGTGAG ACGGTGCAGT TTTATCTGAA GCACCTGGAG 720 

55 GAGAGCGTGT ACCCCGTGAT GACTGAGGAG GAGTTTGCCC TGAAGGTGTT CCCCATGTAT 780 

CGCTACTTCG TGACAGTGTG GCTGAGGCAC TACAACCCCG AGGTGAAGCT GGGGGTGATC B40 

AAGTCCCTGA AGCCCATGCT CGGCCTCCTT CTGCCCAACG ATGACCTGCG GGAGCAGGTC 900 

TACGACTACA TCCCCCTGCT GCTGGCGGAG TACCAGGGCA GTCTGGAGGT CTTGAGGCAG 960 

ATCCTGGAAC TGTCAGTCAC CACCAACACC CCTGTCCCCC AAATGCAGCT ACACACCATT 1020 

60 TTCACAGAAC TGCACGTCCA GGTGTGCAAC AAGGCCCCGG CCCAGCATCA GTACAGCAGC 1080 

CAGAATCTGA TGGAGATGGT GCACTGCTTC GTAGCCCTTG CTCGCTCCTA CCCCAAGGAG 1140 

CTGATGAAGT TCTTCTTCAG CCAGATGGAG ACAAACAAGG AGGCCGTCCG CGTGGGGACT 1200 

CTGAATCTGA TTAGGGCTAT AGTGAGCGCA GATGAGCCCA GGATGAGTAT CAGGGCCATC 1260 

TACCTGGCTA TCCGGGTAGT CAAGAACACC ATCTCTGATA CCCGGTCCAA GGTGAGGATG 1320 

65 GCTATTCTCC ACATCATTGG GCAGTTGGCT CTCTGTGGCT ACCAGGAGAG AATCAAAGGC 1380 

TGGGGCCTGA AGTACCTCTC TGTGCAGCTG ACCTTATCCA CCTACAAACT GACAAATCGC 1440 

CGGGAGAAGT TTTATCAGAG GGACTTGGAG GAGAGGATGG TCCAC AAAGT CACCATGGAC 1500 

ACTGTGAAGA TCATTACCTC TTCTGTCAGT GGGATGACCA CCGAGTTTTG GGTGAGGCTG 1560 

CTGTGCTACA TCATGGAGAC AGACTACGTG GAAGCTTTGA CTCCTATCTG TATCAGCCTC 1620 

70 ACAAACCTGG CAGAACACCA GCTCCATGGC CAGGATGTGG ATGTCAGCGT GGCTGGCAAG 1680 

AGCAGGCAAG TGGACCTGCC TGCACCTCAG AAGCTGCTGG CCCGTCTCCT GGTGCTGATG 1740 

TCATCACCTT ACAAGGGGGA GGGTCGTGGG ATAGCCATGC TCAACCTCTT GAGGACCCTG 1800 

AGCCAGAGCA TCGCACCCTC CATGGCCGAC ATGTGGGAGC TGGAGATTGC GCTACTGGTC 1860 

CGGTACCTGG AAGAACATAC TGAGTTCACT TGGGATCAGA AAGCCTGGGA AGACAAGCTG 1920 

75 ATTCAGTTTC TGCGAAACTC CCTCAAGAAG ACCCGGGGGT CTAGCTGGAG CCTGCGCTTG 1980 

AGTAAAGAGC TGAACAACCA GATTGCGAGC TTTGACAGCC CCTCTCTGGA GAAGGGCTTT 2040 

CTGTACCGGG CCTTGGGCTT CACCTTGGCC ACAGGCCTGG AGGCCAGCAA GGTGGAGGTC 2100 

CTGCTGTTGG AGCTGCTGTA CAAGACGGAC TACAGCAATG ACTTTGACAG CGAGGGTGTG 2160 

262 



WO 03/003906 



PCT/US02/21338 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



ATTATGTGCT 
GACTTCGAGG 
GACCATCCCT 
GCCTCCTACT 
ATCATTCACC 
AGTGTTGTGC 
GCCCAGAAGA 
AACCTGGTTT 
AAGCCTTTCT 
ATTTCTCTCC 
CTGAGCTCCC 
CTGCAGGAGA 
GAAAAGGCCG 
TGTATCCACC 
TGTGATGCCC 
CTTCACGCAA 
TGTAAGGGGG 
GCCAAGGTGG 
TGCGAGAACA 
TTCTTCCTCC 
ATCCTGGTGC 
ATCCTGCTGC 
CTGCCCATGG 
GCCCGGACCA 
AGTGCCACCT 
TGCACCATCC 
CCTGACCTCA 
CCGCCGGTCT 
CTGCAAAGGT 
GTCAAGAGCC 
GACGGCGGGA 
GAGGGCCACA 
AGCTGCCGCA 
CCAGTTCTGT 
GACCAGGAGG 
CTGGGCGCCC 
CCCCTGAGGG 
ATCCTGGCTG 
TGCAGGATCT 
GGAAAGCTGG 
AAGGCCTGGA 
AGATACATCC 
CTTGGGTCTG 
TGGAAGTCCC 
GTTTTCCAGA 
TTCTTGCTAG 
TGCAACTTGG 
TTCCCAGCAT 
GGCCCAAAGG 
GACAGCGTGC 
TGTTCCCAGT 
TGCTATGCTG 
CTGTTTTGTG 
AAACAAGCTT 



TTGGCCTGTG 
AGAGGATCCA 
GGAGGCGGGA 
GCCACCCCCA 
ATTATGTCAG 
AGGTTACCAA 
CGACTCTTAC 
CTCCAGTGCG 
ACTCCACAGA 
AACTCCCAGG 
TGGAGCAGCT 
TGGTGCAGCT 
TGAGCCTCCA 
TAAAGCTGGG 
ATCAAAGAAC 
GCCAGACCTG 
ACCTCCAGAG 
TCTGCATGGA 
CTGGGGCCAT 
AGATGCGGGC 
ACCTGCCGGT 
TGGCGCACCA 
AGAGCCACCT 
TGCTCCACAG 
CCAAGGCTGA 
ACCTTCTCAT 
TCTACACCCT 
TGAAGATGTG 
GCTCTCGAGG 
AGCACCTGGC 
CATTCCTGGA 
GGCAGAGGCT 
TCAGCAGCAC 
ACCAGGAGAA 
AAGACGAGGC 
CCAAGAAGGT 
AGCCCGTGAG 
AGCTCCGGGA 
TCTTCGACAA 
CAAGGGTGGT 
TCCCCCTCAT 
TGGGCTTTGT 
CTGGCTCGGC 
TGGAGCATCC 
CAACCATGTG 
AAACAATGGC 
CAGGAATTAT 
TACGGAATTC 
GTCGGAAACT 
CTAAGGCCAG 
CCAACTGCCA 
TCCAAAATCC 
GAGCAAATGG 
GA 



TGCCCGGGGC 
GGAGTCAGAG 
GACAGTGAAA 
GTTGCTCCTC 
CAGCTGCCAG 
GGCCATCAAC 
CAGCATTATA 
AGCCTTGGCG 
GGAAAACAGT 
AGAGGACAAT 
GATGGAGAGC 
CCTGGAAAAG 
TCTCTATCTC 
GCAGTTTGGC 
CCGCATGGCC 
CTCCTTGTGG 
CACAGATGTG 
GTTTAGCTGC 
GAACCTGCAG 
CAAGGAGCTG 
GGTGGACCAC 
CCACCAGGAG 
GGCAGAGGTG 
CCTGATGGGC 
CATCTGGCGC 
TCAGAAGCTG 
CCTGCTGCAG 
GAAGCTGGTC 
GCGGGTCACT 
ACATACCCTG 
GGGTGTGAGC 
GGCCGAGCTG 
AGCGGTCTGC 
GCTGCTGAAG 
CCTGCGGGTG 
GAAGCAGTAC 
CAACAGCGTG 
AGGGGATGTG 
CGAGAGCGAG 
CGGGATGTCC 
GCTGCACTCC 
GTCCCAGTCT 
CTGTATGGCT 
CTCAGGGCCA 
CTCCATCCTG 
CTATGTTAAA 
TATGAAGCAG 
CCTCCAGGAA 
AGCTTTGTGT 
GCACTGTGCC 
AAGGAATTCA 
AACCCGTGCT 
TGACTTGATT 



CAGGTAAAAA 
CAGTCCTGGC 
AGTGCCCTCA 
AACCTOGTGG 
GACATCTGTC 
AACATCAAGG 
GTGGCGGTCA 
ATGGAGGCCC 
GAGCTGATGG 
GAGTCCATTA 
CTCCTGCAGA 
TGGATCTTGT 
ATGTGGATTT 
ACAATGGTCG 
TCAATGAATG 
GGCCCTTCCA 
GAGAAGATCT 
GATGAGGTGG 
CATGACAAGG 
GAGGACAAGG 
CCAGAGGTGC 
ACCATCCTCA 
TGGCTGGCAG 
CGGCTGCAGT 
CTGGCTGCGG 
GATGAGAATG 
CTTGGAAGCA 
CACACCACTC 
ATCAAGTCCA 
GACGAGCAGG 
CTGCTGGCCA 
GTGCTCAGGG 
GTGGAAATGA 
CCGGCAGCTT 
CTGTCCCTGC 
CGGAAGGTCT 
ACTGCCGAGG 
GGGTCCTCTT 
CTGCTGCGTC 
AAGAAGCATT 
CAGGACCCCT 
GGGGCCCGCT 
ACCATGTTTC 
AGTGATACCG 
ACTCGGAAAA 
AATAACTTGT 
ATGTCTACAC 
CTACAGCTGG 
CTTCACTCGG 
AAGCGAGCAG 
CCATCCGGGG 
AAAGCCTTCT 
TCCTGCCCAG 



CGGTGCTGAA 
AGATCAGTGC 
TGGTGATGTA 
ACAGCCCCAT 
TCAAAATGGC 
ACCTGGAGGA 
TCAAGGCAGA 
TCTCGCACCT 
ATATCAGCAT 
AGACCCTGTA 
GGCAGCTGGA 
CGGAGAAAGA 
ATGTCCACAG 
GACTCATTGC 
TCCTGTCCAG 
AGCAGAAGGA 
TCTGTGCATC 
TCTCGCTCAT 
CCTCTGTCAC 
TGGCCGAGAT 
GGCGCCTTCT 
CATCGCTCCT 
TGTCGGAGAA 
CACGGCTCAG 
TGGACCCCCT 
ACAAGCTCCC 
GCCACCGACC 
CTCTGCCGGA 
TGCAGCTCTT 
CAGTGTGGGA 
GGCTGTGCAT 
GCATGGACTC 
GGCACCGGTT 
TGCTGCTGGA 
GCGCCCTCGG 
TGCTGGAGAA 
GCATGGAGGC 
TCGACGCCAT 
TGAAAGCCTT 
TCTTCAAAGG 
GCTCCAATGC 
GTTCCCCGAG 
AGTGTGTGCA 
CTACTGATGA 
AGCCGGCTGT 
CAAGAATCAG 
ATTATCTGAA 
ACCCGGATCC 
AGTTCATCCA 
GTAAACCAGG 
TCCCATCAAC 
CTGGAACAGT 
TGATAAAGAG 



TGTGCTTCAT 
TTGGCGGAAG 
TAGCTGCGTG 
CACCGCTAAG 
CTTCATGAAG 
CTTTCACTTT 
ACCGACTGAC 
GAGCAAGCTG 
ACATTCTGTA 
TGCAAATGCC 
CCCCAAGGGG 
ATGGGAGCGG 
CACTGCTGTC 
CCCGTGCACC 
CCTGCTAGAT 
GCTTGAGAAA 
CTCCAGAATC 
CCAGAAGCTC 
CTGGATAGCC 
CCTGAGTGCC 
CATTGACGGC 
GAGGCAGCCA 
CGTGCCCTTC 
CCCCAGAATC 
GATGACCCTG 
GGACTTCCTC 
AGAGGCCGCC 
GGAGATGAAC 
GTTCAAGAGA 
CCTCCTGCAG 
GCAGCACGTG 
AGAAGTCCTG 
CATGAGCGGC 
GAAGGGTGCC 
CAACATGGCC 
GTGCCTGGGC 
CCTGACCAAG 
GTCTGAGCAG 
CATCCTCTTT 
GGAGGTGAAG 
AGCCCAAGTA 
GAAACAGGTC 
CTTCTGGGGC 
CAAGATGACC 
TCTCTACCGC 
AATCGCTGCT 
AAAGCTGGAC 
CGGGCCCACT 
CAOGCATGCT 
ACTTCCAAAC 
AGGAGAGAGC 
GGTCTTCAAA 
TATTGCTGCT 



2220 

22B0 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2 940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 



Seq ID NO: 86 Protein sequence* 
Protein Accession »: XP_092643.1 



l 
I 

MTEAITEAAV 
RKTLASVIIM 
QRLVAIASKE 
ITLAKLANGN 
ESVYPVMTEE 
YDYIPLLLiAE 
QNLMEMVHCF 
YLAIRWKNT 
REKFYQRDLE 
TNLAEHQLHQ 
SQSIAPSMAD 
SKELNNQIAS 
IMCFGLCARG 
ASYCHPQLLL 
AQKTTLTSII 
ISLQLPGEDN 
EKAVSLHLYL 
LHASQTCSLW 



11 
I 

ASSEEVSEER 
EKATTEPSW 
MREIPEMEGY 
VFEFMPYMGI 
EFALKVFPMY 
YQGSLEVLRQ 
VALARSYPKE 
ISDTRSKVRM 
ERMVHKVTMD 
QDVDVSVAGK 
MWELBIAUiV 
FDSPSLEKGF 
QVKTVLNVLH 
NLVDSPITAK 
VAVIKAEPTD 
ESIKTLYANA 
MWIYVHSTAV 
GPSKQKELEK 



21 
I 

DDLGPLELHD 
INTLIRCLQV 
MKAEVASDTL 
TLATIFTMLR 
RYFVTVWLRH 
IbEbSVTTHT 
LMKFFPSQME 
AILHI IGQLA 
TVKIITSSVS 
SRQVDLPAPQ 
RYLEEHTEFT 
LYRALGFTLA 
DFEERIQESE 
IIHHYVSSCQ 
NLVSPVRAIA 
LSSLEQLMES 
CIHLKLGQFG 
CKGDLQSTDV 



31 
I 

SGTFQQWNL 
PEISTQRKVN 
VALSRNHPSL 
LANEAKIROA 
YNPEVKLGVI 
PVPQMQLKTI 
TNKEAVRVGT 
LCGYQERIKG 
GMTTEFWVRL 
KLLARLLVLM 
WDQKAWEDKL 
TGLEASKVEV 
QSWQISAWRK 
DICLKMAFMK 
MEALSHLSKL 
LLQRQIiDPKG 
TMVGLIAPCT 
EKIFCASSRI 



41 
I 

LDIIDSESAK 
IYNILQDIIQ 
VMYELQHHLK 
ICSAMETFCE 
KSLKPMLGLL 
FTELHVQVCN 
XjNLIRAIVSA 
WGLKYLSVQL 
LCY IMETDYV 
SSPYKGEGRG 
IQFLRNSLKK 
LLLELLYKTD 
DHPWRRETVK 
SWQVTKAIM 
KPFYSTEENS 
LQEMVQLLEK 
CDAHQRTRMA 
AKWCMEFSC 



51 
I 

TDTTGAGLDM 
QEGELEEQCV 
PUJLTDEFVI 
TVQFYLKHLE 
LPNDDLREQV 
KAPAQHQYSS 
DEPRMSIRAI 
TLSTYKLTNR 
EALTPICISI* 
IAMLNLLRTL 
TRGSSWSLRL 
YSNDFDSEGV 
SALMVMYSCV 
NIKDLEDFHF 
BLMDISIHSV 
WILSBKBWER 
SMNVLSSLLD 
DEWSI*IQKL 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



263 



WO 03/003906 



CENTGAMNLQ HDKASVTWIA FFLQMRAKEL EDKVAEILSA I LVHLPWDH PEVRRLLIDG 1140 

ILLLAHHHQE TILTSLLRQP LPMESHLAEV WLAVSENVPF ARTMLHSLMG RLQSRLSPRI 1200 

SATSKADIWR LAAVDPLMTL CTIHLLIQKL DENDKLPDFL PDLIYTLLLQ LGSSHRPEAA 1260 

PPVLKMWKLV HTTPLPEEMN LQRCSRGRVT IKSMQLL.FKR VKSQHLAHTL DEQAVWDLLQ 1320 

DGGTFLEGVS LLARIiCMQHV EGHRQRLAEL VLRGMDSEVL SCRISSTAVC VEMRHRFMSG 13 BO 

PVLYQEKLLK PAALLLEKGA DQSEDEALRV LSLRALGNMA LGAPKKVKQY RKVLLEKCLG 1440 

PLREPVSNSV TAEGJ4EALTK ILAELREGDV GSSFDAMSEQ CRIFFDNESE LLRLKAFILF 1500 

GKIARWGMS KKHFFKGEVK KAWIPLMLHS QDPCSNAAQV RYILGFVSQS GARCSPRKQV 1560 

LGSAGSACMA TMFQCVHFWG WKSLEHPSGP SDTATDDKMT VFQTTMCSIL TRKKPAVLYR 1620 

FLLETMAYVK NNLSRIRIAA CNLAGIIMKQ MSTHYLKKLD FPALRNSLQE LQLDPDPGPT 1680 

GPKGRKLALC LHSEFIHTHA DSVPKARHCA KRAGKPGIiPN CSQSNCQRNS PSGVPSTGES 1740 
CYAVQNPTRA KAFSGTWFK LFCGANGDLI SCPVIKSIAA KQA 



Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession ft: CAT cluster 



1 11 21 31 41 51 

I I I I I I 

CACTTCTCCC TCTCCTTAGT TCTCCGGTAA CCAAACCTTC TCTCGTTATA TACCCAATGA 60 

TAAAACCAAA GTAATGTTGT ACAGGTTAAA GATTCAAAAC AAACCATAGC AACTGAAACA 120 

GGGTATCAGG ACGCAAATGC TTGGCTGGAA TGGATTAAAC ATTCTGTTGC AGGCTAAACA 1B0 

AAAGCGCCTG TTACACTTGT GTGACAGGCA GGCCAGAGAC CCAAATTGTT CCCTTTCCAC 240 

TTGGGTGGTC CTCTCACAGA CCGGGCATGA GCTGTGTGGT AGCTCTCTTC CAGAACCCCA 300 

CAGCCTGGGG TGGTGAGTCA TGCAAGACTC TTTCACTGCT GTTCCCTGAG GTCAAGAGCC 360 

CTGCAGGTCA GCCCTGAGGA CTATCCGGCC TCCAGCCCCT GATGTTAACT TCACCTCATG 420 

CCTTTCAGGG CAGGGGGAAA AGTTAGCATT CCTTGGAGAC TTAACAGGGT GCAGTGAAAC 480 

CAGGCCTTTT GAAGAGCTTA CCAATCAGTC TGCCCTTGTT CATCCCCGAG CAGATGTGTG 540 

GTGGTATTGC TGGGGACTAC TGCTGGGTAC TCTGCCAAGT AATTAGAGCA GCACTCATGC 600 

TCTAGTCCAA CTGGCCATCC CTTTCACCCT AGCATTCTGT CAATATGATA AAAAAAAATG 660 

TAAAGAAGTG CCTCACGTGG GTCCTTGACC CCCATGTTTA TATAAACACT ATTGGGTTCT 720 
ATGAGGAGAA CCCGATGA 

Seq ID NO: 88 DNA sequence 

Nucleic Acid Accession ft: FGENESH predicted 

Coding sequence: 1..759 

1 11 21 31 41 51 

I I I I 

ATGCAGGAAG AAACAGGCTG GCAGCTTGGG GAAAGGATTC T TACA TACCA CGGCGACCAG 60 

GTAACTCTGT GCACAGACCC AGAAGAAAGT AGAAAAATAA GTTTTGGAGG AAACCTCATT 120 

GTGAACACAC CTCACCAGGT CAGAACTATC CTAAGTCAGA AAGCAGAAAG GTGGCTTACT 180 

GACTCCAGAA TCTTAAAGTA TGAGGCTATT CTCTTAGAAA AAGATGATTT AACATTAACC 240 

ACTGATAATT CACTTAACCC AGCAGGTTTC CTAACAGGGG ATCTATATCA AAAGAGAGAG 300 

CACACATGTT TAGTTTTAAT TAATTACCAT ACAAAGGTCC GACCAGACCT AGGAGAAACT 360 

CCCTTCAGGA CGGGACAACA CTTTATAGAT GGTTCCTCCC AGGTGATAGA GGGAAAAAGA 420 

TACAATGGGT ATTCAGTAAT TGATAGAGAA ACTCTTGTAG AAATAGAGTT AGGAAAATTG 480 

CCTAATAATT GGTCTGCTCA AACATGTGAG CTGTTTGCAC TCAGCCAAGC CTTAAAGGAC 540 

TTACAGAACC AGGAAGGAAC CATCTATACA GATTCTAAGT ACGCCTTTGG AGTGGCTCAT 600 

ACATTTGGAA AAATTTGGAC TGAACGAGGT CTTATTAATA GCAAATGCCA AGACTTGGTC 660 

CATAAGGAAT TAATCACCCA AGTATTAACT A ACCTT CAGC TGCCAAAAGA AACAGCTATT 720 
GACCATGTCC CCGOACACCA AAAAAGCCTT TCTTTTTAA 

Seq ID NO: 89 Protein Sequence 
Protein Accession ft: FGENESH predicted 

1 11 i i I 1 r r 

MQEETGWQLG ERILTYHGDQ VTLCTDPEES RKISFGGNLI VNTPHQVRTI bSQKAERWLT 60 
DSRILKYEAI LLEKDDLTLT TDNSLNPAGF LTGDLYQKRE HTCLVLINYH TKVRPDLGET 120 
PFRTGQHFID GSSQVIEGKR YNGYSVIDRB TLVEIELGKL PNNWSAQTCE LFALSQALKD 180 
LQNQEGTIYT DSKYAFGVAH TFGKIWTERG LINSKCQDLV HKELITQVLT NLQLPKETAI 240 
DHVPGHQKSL SP 



Seq ID NO: 90 DNA sequence 

Nucleic Acid Accession ft: NM_000756.1 

Coding sequence t 186.. 776 

I 11 21 31 41 51 

AGAAACTCAG AGACCAAGTC CATTGAGAGA CTGAGGGGAA AGAGAGGAGA GAAAGAAAAA 60 

GAGAGTGGGA ACAGTAAAGA GAAAGGAAGA CAACCTCCAG AGAAAGCCCC CGGAGACGTC 120 

TCTCTGCAGA GAGGCGGCAG CACCCGGCTC ACCTGCGAAG CGCCTGGGAA GCGAGTGCCC 180 

CTAACATGCG GCTGCCGCTG CTTGTGTCCG CGGGAGTCCT GCTGGTGGCT CTCCTGCCCT 240 

GCCCGCCATG CAGGGCGCTC CTGAGCOGCG GGCCGGTCCC GGGAGCTCGG CAGGCGCOGC 300 

AGCACCCTCA GCCCTTGGAT TTCTTCCAGC CGCCGCCGCA GTCCGAGCAG CCCCAGCAGC 360 

CGCAGGCTCG GCCGGTCCTG CTCCGCATGG GAGAGGAGTA CTTCCTCCGC CTGGGGAACC 420 

TCAACAAGAG CCCGGCCGCT CCCCTTTCGC CCGCCTCCTC GCTCCTCGCC GGAGGCAGCG 480 

GCAGCCGCCC TTCGCCGGAA CAGGCGACCG CCAACTTTTT CCGCGTGTTG CTGCAGCAGC 540 

264 



WO 03/003906 



TGCTGCTGCC TCGGCGCTCG CTCGACAGCC CCGCGGCTCT CGCGGAGCGC GGCGCTAGGA 600 

ATGCCCTCGG CGGCCACCAG GAGGCACCGG AGAGAGAAAG GCGGTCCGAG GAGCCTCCCA 660 

TCTCCCTGGA TCTCACCTTC CACCTCCTCC GGGAAGTCTT GGAAATGGCC AGGGCCGAGC 720 

AGTTAGCACA GCAAGCTCAC AGCAACAGGA AACTCATGGA GATTATTGGG AAATAAAACG 780 

GTGCGTTTGG CCAAAAAGAA TCTGCATTTA GCACAAAAAA AATTTAAAAA AATACAGTAT 840 

TCTGTACCAT AGCGCTGCTC TTATGCCATT TGTTTATTTT TATATAGCTT GAAACATAGA 900 

GGGAGAGAGG GAGAGAGCCT ATACCCCTTA CTTAGCATGC ACAAAGTGTA TTCACGTGCA 960 

GCAGCAACAC AATGTTATTC GTTTTGTCTA CGTTTAGTTT CCGTTTCCAG GTGTTTATAG 1020 

TGGTGTTTTA AAGAGAATGT AGACCTGTGA GAAAACGTTT TGTTTGAAAA AGCAGACAGA 1080 

AGTCACTCAA TTGTTTTTGT TGTGGTCTGA GCCAAAGAGA ATGCCATTCT CTTGGGTGGG 1140 

TAAGACTAAA TCTGTAAGCT CTTTGAAACA ACTTTCTCTT GTAAACGTTT CAGTAATAAA 12 00 

ACATCTTTCC AGTCCTTGGT CAGTTTGGTT GTGTAAGAGA ATGTTGAATA CTTATATTTT 1260 
TAATAAAAGT TGCAAAGGT 



Seq ID NO: 91 Protein Sequence 
Protein Accession fj i NP_000747.1 

1 11 21 31 41 51 

1 | I I I I 

MRLPLLVSAG VLLVALLPCP PCRALLSRGP VPGARQAPQH PQPIjDFFQPP PQSEQPQQPQ 60 

ARPVLLRMGE EYFLRLGNLN KSPAAPLSPA SSLIAGGSGS RPSPEQATAN FFRVLbQQLL 120 

LPRRSLDSPA ALAERGARNA LGGHQEAPER ERRSEEPPIS LDLTFHLLRE VLEMARAEQL 180 

AQQAHSNRKI* MEIIGK 



Seq ID NO: 92 DNA sequence 

Nucleic Acid Accession »: NM_004217.l 

Coding sequences 58. -1092 

1 11 21 31 41 51 

GGCCGGGAGA GTAGCAGTGC CTTGGACCCC AGCTCTCCTC CCCCTTTCTC TCTAAGGATG 60 

GCCCAGAAGG AGAACTCCTA CCCCTGGCCC TACGGCCGAC AGACGGCTCC ATCTGGCCTG 120 

AGCACCCTGC CCCAGCGAGT CCTCCGGAAA GAGCCTGTCA CCCCATCTGC ACTTGTCCTC 180 

ATGAGCCGCT CCAATGTCCA GCCCACAGCT GCCCCTGGCC AGAAGGTGAT GGAGAATAGC 240 

AGTGGGACAC CCGACATCTT AACGCGGCAC TTCACAATTG ATGACTTTGA GATTGGGCGT 300 

CCTCTGGGCA AAGGCAAGTT TGGAAACGTG TACTTGGCTC GGGAGAAGAA AAGCCATTTC 360 

ATCGTGGCGC TCAAGGTCCT CTTCAAGTCC CAGATAGAGA AGGAGGGCGT GGAGCATCAG 420 

CTGCGCAGAG AGATCGAAAT CCAGGCCCAC CTGCACCATC CCAACATCCT GCGTCTCTAC 4B0 

AACTATTTTT ATGACCGGAG GAGGATCTAC TTGATTCTAG AGTATGCCCC CCGCGGGGAG 540 

CTCTACAAGG AGCTGCAGAA GAGCTGCACA TTTGACGAGC AGCGAACAGC CACGATCATG 600 

GAGGAGTTGG CAGATGCTCT AATGTACTGC CATGGGAAGA AGGTGATTCA CAGAGACATA 660 

AAGCCAGAAA ATCTGCTCTT AGGGCTCAAG GGAGAGCTGA AGATTGCTGA CTTCGGCTGG 720 

TCTGTGCATG CGCCCTCCCT GAGGAGGAAG ACAATGTGTG GCACCCTGGA CTACCTGCCC 780 

CCAGAGATGA TTGAGGGGCG CATGCACAAT GAGAAGGTGG ATCTGTGGTG CATTGGAGTG 840 

CTTTGCTATG AGCTGCTGGT GGGGAACCCA CCCTTTGAGA GTGCATCACA CAACGAGACC 900 

TATCGCCGCA TCGTCAAGGT GGACCTAAAG TTCCCCGCTT CTGTGCCCAC GGGAGCCCAG 960 

GACCTCATCT CCAAACTGCT CAGGCATAAC CCCTCGGAAC GGCTGCCCCT GGCCCAGGTC 1020 

TCAGCCCACC CTTGGGTCCG GGCCAACTCT CGGAGGGTGC TGCCTCCCTC TGCCCTTCAA 1080 

TCTGTCGCCT GATGGTCCCT GTCATTCACT CGGGTGCGTG TGTTTGTATG TCTGTOTATG 1140 

TATAGGGGAA AGAAGGGATC CCTAACTGTT CCCTTATCTG TTTTCTACCT CCTCCTTTGT 1200 
TTAATAAAGG CTGAAGCTTT TTGT 



Seq ID NO: 93 Protein Sequence 
Protein Accession #: NP_004208 



1 11 21 |1 41 51 

MAOKENSYPW PYGRQTAPSG LSTLPQRVLR KEPVTPSALV LMSRSNVQPT AAPGQKVMEN 
SSGTPDILTR HFTIDDFBIG RPLGKGKFGN VYLAREKXSH FIVALKVLFK SQIEKEGVEH 
QLRREIEIQA HLHHPNILRL YNYFYDRRRI YLILEYAPRG ELYKELQKSC TPDEQRTATI 
MEELADALMY CHGKKVIHRD IKPENLLLGL KGELKIADFG WSVHAPSLRR KTMCGTLDYL 240 
PPEMIEGRMH NEKVDLWCIG VLCYELLVGN PPFESASHNE TYRRIVKVDL KPPASVPTQA 300 
QDLISKLLRH NPSERLPLAQ VSAHPWVRAN SRRVLPPSAL QSVA 



60 

120 

180 



Seq ID NO: 94 DNA sequence 

Nucleic Acid Accession «: NM_007019.1 

Coding sequence: 41.. 580 



l 11 21 31 41 51 

GGCACGAGCG AGTTCCTGTC TCTCTGCCAA CGCCGCCCGG ATGGCTTCCC AAAACCGCGA 60 

CCCAGCCGCC ACTAGCGTCG CCGCCGCCCG TAAAGGAGCT GAGCCGAGCG GGGGCGCCGC 120 

CCGGGGTCCG GTGGGCAAAA GGCTACAGCA GGAGCTGATG ACCCTCATGA TGTCTGGCGA 180 



CCGGGGTCCG GTGGGCAAAA WLlAU^Oi wn^»wi« 

TAAAGGGATT TCTGCCTTCC CTGAATCAGA CAACCTTTTC AAATGGGTAG GGACCATCCA 240 

TCGAGCAGCT GGAACAGTAT ATGAAGACCT GAGGTATAAG CTCTCGCTAG AGTTCCCCAG 300 

TGGCTACCCT TACAATGCGC CCACAGTGAA GTTCCTCACG CCCTGCTATC ACCCCAACGT 360 

GGACACCCAG GGTAACATAT GCCTGGACAT CCTGAAGGAA AAGTGGTCTG CCCTGTATGA 420 

TCTCAGGACC ATTCTGCTCT CCATCCAGAG CCTTCTAGGA GAACCCAACA TTGATAGTCC 480 



265 
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PCT/US02/21338 



10 

15 

20 

25 

30 

35 

40 

45 

50 

55 

60 

65 

70 

75 



CTTGAACACA 
AGAAACCTAC 
CTTGTGTCGT 
GACTCTTTAT 
GTTGAGCCCT 
AAA 



CATGCTGCCG AGCTCTGGAA AAACCCCACA GCTTTTAAGA AGTACCTGCA 
TCAAAGCAGG TCACCAGCCA GGAGCCCTGA CCCAGGCTGC CCAGCCTGTC 
CTTTTTAATT TTTCCTTAGA TGGTCTGTCC TTTTTGTGAT TTCTGTATAG 
CTTGAGCTGT GGTATTTTTG TTTTGTTTTT GTCTTTTAAA TTAAGCCTCG 
TGTATATTAA ATAAATGCAT TTTTGTCCTT TTTTAAAAAA AAAAAAAAAA 



540 
600 
660 
720 
780 



Seq ID NO: 95 Protein Sequence 
Protein Accession #: NP_008950.i 

l 11 



31 



41 



51 



21 

| 1 | I 1 1 

MASQNRDPAA TSVAAARKGA EPSGGAARGP VGKRLQQELM TLMMSGDKGI SAFPESDNLF 60 
KWVGTIHGAA GTVYEDLRYK LSLEFPSGYP YNAPTVKFIiT PCYHPNVDTQ GNICLDILKE 120 
KWSALYDVRT ILLSIQSLLG EPNIDSPLNT HAAELWKNPT AFKKYLQETY SKQVTSQEP 179 

Seq ID NO: 96 DNA sequence 
Nucleic Acid Accession ft: AK055663 
Coding sequence: 38.. 1423 



11 



AGAACGGCTT 
AAAACCACAA 
CCGAAGGTCC 
GCTTATGTGG 
TTTTGATCTT 
TAGCCCTGTC 
AGTCTTGGCA 
ACAGCCCGAG 
CCTGTTCACG 
TACGAGCTGG 
GGGACTTAGC 
AGCATTTGCT 
CACTGCCTCT 
GTACAGTGGG 
ACTCATCAGA 
GACCCTAGGT 
TGAACAAATG 
TGTTCAAATT 
CAATGTCCTA 
TGATTTGAAC 
ATTTAACACT 
TTATGGTTTT 
TGGAGTTCCA 
TAGATATGGA 
TTATAAGGAA 
TGTTTAATCA 
TATGAAACTA 
GCTTTAAATA 
GTTTTGTAGT 
AGATCTGTCA 
TGCCACTGTG 
CTTAGTTTTT 
ATGCAGTGGC 
TGCCTCAGCC 
TGTATTTTTA 
ACCTCATGAT 
CCTGGCCGAT 
GGGAAAGGGA 
AATTGCTAAA 
TTTTTAGCAG 
GATTTTTGTT 



CCGGCGGGAG 
AGATCCTTTT 
TGGAAGATAC 
TGCAGTTCTA 
TTTAGTTTAA 
TATTCATTTG 
CAGTTGGGAG 
ATACACACGG 
ATGCTTTCTA 
CTTCAAGAGC 
AGTATCTTCC 
CTTTGTATTA 
GCTATAGCTA 
AAAGTCTTAC 
GAGGTATCTA 
TTTGGCTCAT 
GTTCTTGCTC 
TTCAAGGATG 
AACTTTTCAG 
CCAGTTACAT 
CCTGGGAAAA 
GGTCTCAATC 
GGAATTGGAG 
ACTAATAATA 
TATTGACTCC 
TTTACTCTAA 
TATTTTTGTA 
GGCTTCCTTT 
TGACTGCAGT 
CATTACTAAG 
CCCGGCCAAT 
GTTTTGTTTT 
ATGATCTCAG 
TCCCGAGTAG 
GTAAAGACGG 
CCACCCACCT 
ATTTTCTTTA 
AAAATGTCTG 
TTTTTCTTTG 
AAATTTTGGA 
AAAGTTTCTC 



21 
1 

CTGTGCAGCT 
TTGGCAAGTT 
TGCTCTTTGG 
CTAATAGTAT 
TGACATGTTT 
GGTTTGAAAG 
CTCTCTTTAT 
GAAGATTATT 
TTCGGAATAA 
ATGTTGCAGA 
TTCCCCGAAT 
CATATATGCT 
TTGCCTTGAT 
TCCAGACAAC 
CCTTAGATGG 
TGGCTGGATC 
ATGTGACCAA 
ACTGGATTAG 
ATCATCACGT 
CAACTCCAGC 
ATGTGAACCC 
ATGGACACAC 
CAACTCAAGG 
GAATTGGACA 
TTGGCTTCCA 
ATGTTAGATA 
AAATGTATTT 
AGAAAATGTG 
GTGATGTGAC 
ATACGATATT 
ACATTATTAT 
GTTTTTTGAG 
CTCACTGCAA 
CTGGGATTAC 
GGGATTTCAC 
TAGCCTCCCA 
ATGAAATTTA 
TTCAAAAAGT 
AGGTTCTCCT 
ATACATTCTA 
TCCTTTAAAA 



31 



41 



51 



CCTTATCATG 
GTTACGGGAA 
TGTAATAAAC 
AGCTTTAACT 
AATAAGTTAC 
ATTAGAAGTC 
ATTAAAAGAA 
AGTTGGTACT 
ACCTTTTGCT 
TCTTAGTCGA 
GAATCCATTT 
CATTGAAATT 
GACATTTGGC 
ACCACCCCAT 
AGTTTTAGAA 
AGTGCATGTA 
CAGGCTGTAC 
GCCTGCCTTA 
AATCCCAATG 
TAAACCTAGT 
AGTTATTCTT 
ACCTTACAGC 
ATTGAGGACT 
ACCAAGACCA 
ATTTATTTAG 
ATAGTAGTCT 
GTGACAGTGA 
TTTCTTTAAA 
CTTACCTTTA 
TCTTTTTTTT 
TAACTTAAGG 
ATGGAGTCTC 
CCTCTGCCTC 
AGGCACCTGC 
CATGTTGGCC 
AAGTGCTGGG 
TAAATATGCT 
AAAGGTCTCT 
GAATTATGTC 
TCTAGCACAA 
ATTTTAGTAC 



GGGACAATTC 
TTTAGACTTG 
TTGATATGTA 
GCCTATACTT 
TGGGTAACAT 
CTGGCTGTAT 
AGTGCAGAAC 
TTTGTGGCTC 
TATGTCTCAG 
AGCTTGTGTG 
GTTTTGATTG 
AATAATTATT 
ACTATGTATC 
GTTATTGGTC 
GTCCGAAATG 
AGAATTCGAC 
ACTCTAGTGT 
TTGTCTGGGC 
CCTCTTTTAA 
AGTCCACCTC 
CTAAACACAC 
AGCATGCTTA 
GGTTTTACAA 
TGATAGACTC 
TAATCCAACT 
TGTTCACATT 
AATCCTCGTA 
TTTGGATTTT 
TAAGAGCCAC 
TCCGAGACGG 
CTGTACTTTA 
ACTCTGTCGC 
CTGAGTTCAA 
CACCACGCCC 
AGGCTGGTCT 
ATTAGGTGTG 
TCTTGAATAA 
TTTATAGCTT 
TTACAAACTA 
TTTGAATTTT 
ATTTGTAAAT 



ATCTCTTTCG 
TAGCAGCTGA 
CTGGCTTCCT 
ACCTGACCAT 
TGAGGAAACC 
TTGCCTCCAC 
GCTTTTTGGA 
TTTGTTTCAA 
AAGCTGCTAG 
GAATTATTCC 
ATCTTGCTGG 
TTGCCGTAGA 
CCATGAGTGT 
AGTTGGACAA 
AACATTTTTG 
GAGATGCCAA 
CTACTCTAAC 
CTGTTGCAGC 
AGGGTACTGA 
CAGAATTTTC 
AAACAAGGCC 
ATCAAGGACT 
ATATACCAAG 
TAACTTATTT 
TTGCATTGAC 
TCATGAAACC 
AATGTTAAAG 
GGTATCTTTG 
TTGATGGAGT 
AGTCTTGCTC 
TTAAGGCTTC 
CCAGGCTGGA 
ATGATTCTCC 
AGCTAATTTT 
TGAACTCCTG 
AGCCACCGCA 
TACACATTTT 
TTCCAAACTT 
AAAGCAAAAA 
TAATTATCAA 



Seq ID NO: 97 Protein Sequence 
Protein Accession #: BAB709B0.1 



MGTIHLPRKP 
TAYTYLTIFD 
ESAERPLEQP 
RSLCGIIPGI. 
GTMYPMSVYS 
VRIRRDANEQ 
MPLLKGTDDIi 
SSMLNQGLGV 



11 
I 

QRSFFGKLLR 
LFSLMTCLIS 
EIHTGRLLVG 
SSIFLPRMNP 
GKVLIjQTTPP 
MVLAHVTNRL 
NPVTSTPAKP 
PGIGATQGLR 



21 
I 

EFRLVAADRR 
YWVTLRKPSP 
TFVALCFNLF 
FVLIDLAGAF 
HVIGQLDKLI 
YTLVSTLTVQ 
SSPPPETSFN 
TGFTNIPSRY 



31 

I 

SWKILLFGVI 
VYSFGFERLE 
TMLSIRNKPF 
ALCITYKLIE 
REVSTLDGVL 
IFKDDWIRPA 
TPGKNVNPVI 
GTNNRIGQPR 



41 

I 

NtilCTGFIiLM 
VLAVFASTVL 
AYVSEAASTS 
INNYFAVDTA 
EVRNEHFWTL 
LLSGPVAANV 
LLNTQTRPYG 
P 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



51 
I 

WCSSTNSIAL 
AQLGALPILK 
WLQEHVADLS 
SAIAIALMTF 
GFGSLAGSVH 
LNFSDHHVIP 
FGLNHGHTPY 



60 

120 

180 

240 

300 

360 

420 



266 
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Seq ID NO: 98 DNA sequence 

Nucleic Acid Accession fl: CAT cluster 

1 11 21 31 41 SI 

11)111 
ATGGACTTAA CCAATGCTTC ACAAATACCA AGAAAAAAAT ATTTCAATGT GATAAATATA 60 
GGAAAGTTTT TCATAAATTT TCAAATTCAA ACTGACATAA GAGAAGACAT ACGGGAAAAA 120 
AGACCTTTCA AATGTATAGA ATGTGGCAAA GCTTTTAACA AGTTTTCAAC CCTTACTACA 180 
CATAAGAAAA TTCATACTAG AGAGAAACCC TACAAATGTG AAGAATGTGG CAAAGCCTTC 240 
AACTGGTCCT CATACCTTAC TATACATAAG AGAATTCATA CTGGAGAGAA ACTCTACAAA 300 
TGTGAAGAAT GTGGCAAAGC CTTCAACTGG TCCTCATACC TTACTGCATA TAAGATAACT 360 
CATACTAGAG AGAAACCCTA CAAATGTGAA GAATGTGGCA AAGCCTTTAA GTACTCCTCG 420 
AACCTCACTA CACATAAGAT AATTCATACT GGAGAGCATC TCTACAAGTG TGAAGAATGT 480 
GGCAAAGCGT TTAACCATGC TGCATGCTTC TTTGTCATTT TGAAGATTTG A 

Seq ID NO: 99 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1..531 

1 11 21 31 41 51 

ATGGACTTAA CCAATGCTTC ACAAATACCA AGAAAAAAAT ATTTCAATGT GATAAATATA 60 

GGAAAGTTTT TCATAAATTT TCAAATTCAA ACTGACATAA GAGAAGACAT ACGGGAAAAA 120 

AGACCTTTCA AATGTATAGA ATGTGGCAAA GCTTTTAACA AGTTTTCAAC CCTTACTACA 180 

CATAAGAAAA TTCATACTAG AGAGAAACCC TACAAATGTG AAGAATGTGG CAAAGCCTTC 240 

AACTGGTCCT CATACCTTAC TATACATAAG AGAATTCATA CTGGAGAGAA ACTCTACAAA 300 

TGTGAAGAAT GTGGCAAAGC CTTCAACTGG TCCTCATACC TTAC TGCA TA TAAGATAACT 360 

CATACTAGAG AGAAACCCTA CAAATGTGAA GAATGTGGCA AAGCCTTTAA GTACTCCTCG 420 

AACCTCACTA CACATAAGAT AATTCATACT GGAGAGCATC TCTACAAGTG TGAAGAATGT 480 
GGCAAAGCGT TTAACCATGC TGCATGCTTC TTTGTCATTT TGAAGATTTG A 

Seq ID NO: 100 Protein Sequence 
Protein Accession 8: FGENESH predicted 

1 11 21 31 41 51 

llDLTNASQIP RKKYFNVINI GKFFINFQIQ TDIREDIREK RPFKCIECGK AFNKFSTLTT 60 
HKKIHTREKP YKCEECGKAF NWSSYI/TIHK RIHTGEKLYK CEECGKAFNW SSYLTAYKIT 120 
HTREKPYKCE ECGKAFKYSS NLTTHKIIHT GEHLYKCEEC GKAFNHAACF FVILKI 

Seq ID NO: 101 DNA sequence 

Nucleic Acid Accession 8: NM_032589.1 

Coding sequence: 147-422 



60 
120 



1 11 21 31 41 SI 

GGCACGAGGT CTTGTTATGC AAATAGGCTT CCCACTTGGC AGGGGCCGTC TTGTCCACTC 
GTTTCTGTAA ACATGGGTGG CAAAAAGAGA AGATGGAGCT GCCATTTAGA ACATGOCTAA 

TCCCAGCTTC ATCTTGCTGA GCAAAAATGA AGGAGCCTGG ACCCAACTTT GTTACTGTGA 180 

GAAAGGGTCT TCATTCATTC AAGATGGCAT TTGTTAAGCA CCTACTGCTG GAGTGCAGTG 240 

GTTCAATCAC GGATCACTGC AGCCTCCACC TCCCAGTTCA AGAAATTCTC ATGTCTCAGC 300 

CTCCTGAGCA GCTAGGATTA CAGACAAACC TTGGAAATCA AGAAAGTTCT GGAATGATGA 360 

AGCTGTTCAT GCCAAGACCG AAAGTGCTGG CCCAGTATGA GTCCATTCAG TTCATGCCGT 420 

GACAATTTTC TTGGAACTCC TTTTTATTGT TAGTTCTCAC TTGTTTCCAT ATTTAGTGAA 480 
TGTACATTTA ATTGCAAAGC TGTCATTAAT AAAAATTCTT ATAGTACCTC ACTGCAC 



Seq ID NO: 102 Protein sequence: 
Protein Accession #: NP_115978.1 

1 11 21 31 41 51 

MKEPGPNFVT VRKGLHSFKM AFVKHLLLEC SGSITDHCSL HLPVQEILMS QPPEQLGLQT 
NLGNQESSGM MKLFMPRPKV LAQYESIQFM P 

Seq ID NO: 103 DNA sequence 
Nucleic Acid Accession ft: NM_01450l.l 
Coding sequence: 60 -.737 

1 11 21 31 41 51 

GGCGGACCGA AGAACGCAGG AAGGGGGCCG GGGGGACCCG CCCCCGGCCG GCCGCAGCCA 

TGAACTCCAA CGTGGAGAAC CTACCCCCGC ACATCATCCG CCTGGTGTAC AAGGAGGTGA 120 

CGACACTGAC CGCAGACCCA CCCGATGGCA TCAAGGTCTT TCCCAACGAG GAGGACCTCA 180 

CCGACCTCCA GGTCACCATC GAGGGCCCTG AGGGGACCCC ATATGCTGGA GGTCTGTTCC 240 

GCATGAAACT CCTGCTGGGG AAGGACTTCC CTGCCTCCCC ACCCAAGGGC TACTTCCTGA 300 

CCAAGATCTT CCACCCGAAC GTGGGCGCCA ATGGCGAGAT CTGCGTCAAC GTGCTCAAGA 360 

GGGACTGGAC GGCTGAGCTG GGCATCCGAC ACGTACTGCT GACCATCAAG TGCCTGCTGA 420 

267 



60 



WO 03/003906 



TCCACCCTAA CCCCGAGTCT GCACTCAACG AGGAGGCGGG CCGCCTGCTC TTGGAGAACT 480 

AOGAGGAGTA TGCGGCTCGG GCCCGTCTGC TCACAGAGAT CCACGGGGGC GCCGGCGGGC 540 

CCAGCGGCAG GGCCGAAGCC GGTCGGGCCC TGGCCAGTGG CACTGAAGCT TCCTCCACCG 600 

ACCCTGGGGC CCCAGGGGGC CCGGGAGGGG CTGAGGGTCC CATGGCCAAG AAGCATGCTG 660 

GCGAGCGCGA TAAGAAGCTG GCGGCCAAGA AAAAGACGGA CAAGAAGCGG GCGCTGCGGG 720 

CGCTGCGGCG GCTGTAGTGG GCTCTCTTCC TCCTTCCACC GTGACCCCAA CCTCTCCTGT 780 

CCCCTCCCTC CAACTCTGTC TCTAAGTTAT TTAAATTATG GCTGGGGTCG GGGAGGGTAC 840 
AGGGGGCACT GGGACCTGGA TTTGTTTTTC TAAATAAAGT TGGAAAAGCA 

Seq ID NO: 104 Protein Sequence 
Protein Accession Jh NP_05S316.l 

1 11 21 31 41 51 

| | | I I I 

MNSNVENLPP HI IRLVYKEV TTLTADPPDG IKVFPNEEDL TDLQVTIEGP EGTPYAGGLF 60 

RMKLLLGKDF PASPPKGYFL TKIFHPNVGA NGEICVNVLK RDWTAELGIR HVLLTIKCLL 120 

IHPNPESALN EEAGRLLLEN YEEYAARARL LTEIHGGAGG PSGRAEAGRA liASGTEASST 160 

DPGAPGGPGG AEGPMAKKHA GERDKKLAAK KKTDKKRALR ALRRL 



Seq ID NO: 105 DNA sequence 
Nucleic Acid Accession ft: NMJ305101 
Coding sequence: 76.-573 

1 11 21 31 41 51 

CGGCTGAGAQ GCAGCGAACT CATCTTTGCC AGTACAGGAG CTTGTGCCGT GGCCCACAGC 60 

CCACAGCCCA CAGCCATGGG CTGGGACCTG ACGGTGAAGA TGCTGGCGGG CAACGAATTC 120 

CAGGTGTCCC TGAGCAGCTC CATGTCGGTG TCAGAGCTGA AGGCGCAGAT CACCCAGAAG 180 

ATTGGCGTGC ACGCCTTCCA GCAGCGTCTG GCTGTCCACC CGAGCGGTGT GGCGCTGCAG 240 

GACAGGGTCC CCCTTGCCAG CCAGGGCCTG GGCCCTGGCA GCACGGTCCT GCTGGTGGTG 300 

GACAAATGCG ACGAACCTCT GAGCATCCTG GTGAGGAATA ACAAGGGCCG CAGCAGCACC 360 

TACGAGGTCC GGCTGACGCA GACCGTGGCC CACCTGAAGC AGCAAGTGAG CGGGCTGGAG 420 

GGTGTGCAGG ACGACCTGTT CTGGCTGACC TTCGAGGGGA AGCCCCTGGA GGACCAGCTC 480 

CCGCTGGGGG AGTACGGCCT CAAGCCCCTG AGCACCGTGT TCATGAATCT GCGCCTGCGG 540 

GGAGGCGGCA CAGAGCCTGG CGGGCGGAGC TAAGGGCCTC CACCAGCATC CGAGCAGGAT 600 
CAAGGGCCGG AAATAAAGGC TGTTGTAAGA GAAT 

Seq ID NO: 106 Protein Sequence 
Protein Accession 8: NP_005092 

1 U 21 31 41 51 

MGWDLTVKML AGNEFQVSLS SSMSVSELKA QITQKIGVHA FQQRLAVHPS GVALQDRVPL 
ASQGLGPGST VLLWDKCDE PLSILVRNNK GRSSTYEVRL TQTVAHLKQQ VSGLEGVQDD 
LFWLTFEGKP LEDQLPLGEY GLKPLSTVFM NLRLRGGGTE PGGRS 

Seq ID NO: 107 DNA sequence 
Nucleic Acid Accession ft: NM_006018 
Coding sequence: 1..1161 

1 11 21 31 41 ^ 

ItGAATCGGC icCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 
TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTi^CTTT 
GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 
TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGC CT G 
CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA AGTTTGGGGA CATCCCTTGC 



60 
120 



60 
120 
180 
240 
300 

CCGTTCGTGA TWJAtriALTiM iuiw-ww-w* * w*w™--.~~-- — — J ~_ , ., „».-, , £n 

SgCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360 
GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420 
AaSSg CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 480 
CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 540 
AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 600 
CTCAGCCAfiA AHAlCm = GAGACJA^ 6 0 
GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720 
SSSSSS TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780 
« AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 840 
Ig^ScCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900 
^J^CT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960 
SISt^S Ic^SgSc gIgcgtcgag CTCACAGGGG accccaacaa AACCAGAGGC 1020 
CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA ^GGCCCA 1080 
^TCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1140 
CAGTTGGGAT GTTGCATCGA G 

Seq ID NO: 108 Protein Sequence 
Protein Accession ft: NP_006009.1 

x ii 21 31 41 51 

268 



WO 03/003906 



PCT/US02/21338 



MNRHHLQDHF LEIDKKNCCV FRDDFIAKVL PPVLGLEFIF GLLGNGLALW IFCFHLKSWK 60 

SSRIFLFNLA VADFLLI ICL PFVMDYYVRR SDWKFGDIPC RLVLFMFAMN RQGSIIFLTV 120 

VAVDRYFRW HPHHALNKIS NWTAAIISCL LWGITVGLTV HLLKKKLLIQ NGPANVCISF 180 

5 SICHTFRWHE AMFLLEFLLP LGIILFCSAR IIWSLRQRQM DRHAKIKRAI TFIMWAIVF 240 

VICFLPSWV RIRIFWLLHT SGTQNCEVYR SVDIiAFFITL SFTYMNSMLD PWYYFSSPS 300 

FPNFFSTLIN RCLQRKMTGE PDNNRSTSVE LTGDPNKTRG APEALMANSG EPWSPSYLGP 360 
TSNNHSKKGH CHQEPASLEK QLGCCIE 

10 Seq ID NO: 109 DNA sequence 

Nucleic Acid Accession #: NMJJ05030.2 
Coding sequence: 63.. 1874 

1 11 21 31 41 51 

15 | III I I 

GGCACGAGGG AGCGGTGCGG AGGCTCTGCT CGGATCGAGG TCTGCAGCGC AGCTTCGGGA 60 

GCATGAGTGC TGCAGTGACT GCAGGGAAGC TGGCACGGGC ACCGGCCGAC CCTGGGAAAG 120 

CCGGGGTCCC CGGAGTTGCA GCTCCCGGAG CTCCGGCGGC GGCTCCACCG GC GAAAG AGA 180 

TCCCGGAGGT CCTAGTGGAC CCACGCAGCC GGCGGCGCTA TGTGCGGGGC CGCTTTTTGG 240 

20 GCAAGGGCGG CTTTGCCAAG TGCTTCGAGA TCTCGGACGC GGACACCAAG GAGGTGTTCG 300 

CGGGCAAGAT TGTGCCTAAG TCTCTGCTGC TCAAGCCGCA CCAGAGGGAG AAGATGTCCA 360 

TGGAAATATC CATTCACCGC AGCCTCGCCC ACCAGCACGT CGTAGGATTC CACGGCTTTT 420 

TCGAGGACAA CGACTTCGTG TTCGTGGTGT TGGAGCTCTG CCGCCGGAGG TCTCTCCTGG 480 

AGCTGCACAA GAGGAGGAAA GCCCTGACTG AGCCTGAGGC CCGATACTAC CTACGGCAAA 540 

25 TTGTGCTTGG CTGCCAGTAC CTGCACCGAA ACCGAGTTAT TCATCGAGAC CTCAAGCTGG 600 

GCAACCTTTT CCTGAATGAA GATCTGGAGG TGAAAATAGG GGATTTTGGA CTGGCAACCA 660 

AAGTCGAATA TGACGGGGAG AGGAAGAAGA CCCTGTGTGG GACTCCTAAT TACATAGCTC 720 

CCGAGGTGCT GAGCAAGAAA GGGCACAGTT TCGAGGTGGA TGTGTGGTCC ATTGGGTGTA 780 

TCATGTATAC CTTGTTAGTG GGCAAACCAC CTTTTGAGAC TTCTTGCCTA AAAGAGACCT 840 

30 ACCTCCGGAT CAAGAAGAAT GAATACAGTA TTCCCAAGCA CATCAACCCC GTGGCCGCCT 900 

CCCTCATCCA GAAGATGCTT CAGACAGATC CCACTGCCCG CCCAACCATT AACGAGCTGC 960 

TTAATGACGA GTTCTTTACT TCTGGCTATA TCCCTGCCCG TCTCCCCATC ACCTGCCTGA 1020 

CCATTCCACC AAGGTTTTCG ATTGCTCCCA GCAGCCTGGA CCCCAGCAAC CGGAAGCCCC 1080 

TCACAGTCCT CAATAAAGGC TTGGAGAACC CCCTGCCTGA GCGTCCCCGG GAAAAAGAAG 1140 

35 AACCAGTGGT TCGAGAGACA GGTGAGGTGG TCGACTGCCA CCTCAGTGAC ATGCTGCAGC 1200 

AGCTGCACAG TGTCAATGCC TCCAAGCCCT CGGAGCGTGG GCTGGTCAGG CAAGAGGAGG 1260 

CTGAGGATCC TGCCTGCATC CCCATCTTCT GGGTCAGCAA GTGGGTGGAC TATTCGGACA 1320 

AGTACGGCCT TGGGTATCAG CTCTGTGATA ACAGCGTGGG GGTGCTCTTC AATGACTCAA 1380 

CACGCCTCAT CCTCTACAAT GATGGTGACA GCCTGCAGTA CATAGAGCGT GACGGCACTG — 
40 AGTCCTACCT CACCGTGAGT TCCCATCCCA ACTCCTTGAT GAAGAAGATC ACCCTCCTTA 
AATATTTCCG CAATTACATG AGCGAGCACT TGCTGAAGGC AGGTGCCAAC ATCACGCCGC 

GCGAAGGTGA TGAGCTCGCC CGGCTGCCCT ACCTACGGAC CTGGTTCCGC ACCCGCAGOG 1620 

CCATCATCCT GCACCTCAGC AACGGCAGCG TGCAGATCAA CTTCTTCCAG GATCACjKCA 1680 

AGCTCATCTT GTGCCCACTG ATGGCAGCCG TGACCTACAT CGACGAGAAG CGGGACTTCC • 
45 GCACATACCG CCTGAGTCTC CTGGAGGAGT ACGGCTGCTG CAAGGAGCTG GCC^CCGGC 
TCCGCTACGC CCGCACTATG GTGGACAAGC TGCTGAGCTC ACGCTCGGCC AGCAACCGTC 
TCAAGGCCTC CTAATAGCTG CCCTCCCCTC CGGACTGGTG CCCTCCTCAC TCCCACCTGC 
ATCTGGGGCC CATACTGGTT GGCTCCCGCG GTGCCATGTC TGCAGTGTGC CCCGGAGCCC 
CGGTGGCTGG GCAGAGCTGC ATCATCCTTG CAGGTGGGGG TTGCTGTGTA AGTTATTTTT 

50 gS^ GGGTGTGGGT TCTACAGCCT TGTCCCCCTC CCCCTCAACC ^CATATG 2100 

AATTGTACAG AATATTTCTA TTGAATTCGG AACTGTCCTT TCCTTGGCTT TATGCACATT 2160 
AAACAGATGT GAATATTCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 



1440 
1500 
1560 



1740 
1800 
1860 
1920 
1980 
2040 



55 Seq ID NO: 110 Protein Sequence 
Protein Accession ti NPJ>05021.2 

1 11 f f i 1 i 1 

60 MSRAVTAGKL ARAPADPGKA GVPGVAAPGA PAAAPPAKEI PEVLVDPRSR RRYVRGRFLG 
KGGFAKCFEI ^S^EVFA GKIVPKSLLL KPHQREKMSM BISIHRSLAH OH™"*)" 
EDNDFVFWL ELCRRRSLLE LHKRRKALTE PEARYYLRQI VLGCQYLHRM RVIHRDLKLG 

kigdfSatk veydgerkkt lcgtpnyiap evlskkghsp 

MCTlIvGKPP FETSCLKETY LRIKKNEVSI PKHINPVAAS LIQKMLQTDP TARPTINELb 300 

65 parlpTtclt IPPRFSIAPS SLDPSNRXPL TVLHKGLENP LPERPREKEE 360 

DCHLSDMLQQ LHSVHASKPS ERGLVRQEEA EDPACIPIFW VSKWVDYSDK 420 

yS£™ SVGVLPNDS? RLIWNDGDS LQYIERDGTB SYLTVSSHPN SLMKKITLLK 480 

YFRNYMSEHL LKAGANITPR EGDELARLPY LRTWFRTRSA IILHLSHGSV QINPFQDHTK 540 
IiSpS ^DEKRDFR TYRLSLLEEY GCCKELASRL RYARTMVDKL LSSRSASNRL 



70 



75 



XAS 



Seq ID NO: 111 DNA sequence 

Nucleic Acid Accession 8: NMJJ05409.3 

Coding sequence: 94.. 378 

1 11 f I 1 i 1 I' 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 

269 
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600 
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CAACAGCACC AGCAGCAACA GCAAAAAACA AACATQAGTG TGAAGGGCAT GGCTATAGCC 120 

TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAGAGGACGC 1BO 

TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 

AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC ATATGAAGTC CTGGAAAAGG GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACT GTGACTACTG AAATGACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTCT ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 

GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 600 

CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

GGTTACCATC GGAGTTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 

GCATTTCTAG GCTAGAGAAC CTTCTAGATT TGATGCTTAC AACTATTCTG TTGTGACTAT 780 

GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 

CTGTGGTTAC AGTGGAGACA TTGACATTAT TACTGGAGTC AAGCCCTTAT AAGTCAAAAG 900 

CATCTATGTG TCGTAAAG CA TTCCTCAAAC ATTTTTTCAT GCAAATACAC ACTTCTTTCC 960 

CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 

TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1140 

GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC 1200 

TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTGAAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAGAAAAAT ACATAATCTA ATCAATTTCT 1380 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATA CTCAA GACTAGTTTA AAGAATCAAA 1440 
GTCATTTTTT TCTCTAATAA ACTACCACAA CCTTTCTTTT TTAAAAAAAA AAA 

Seq ID NO: 112 Protein Sequence 
Protein Accession #: NP_005400.1 

1 11 21 31 41 51 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 
EVIITLKENK GQRCLNPKSK QARLIIKKVE RKNF 



120 
180 
240 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



Seq ID NO: 113 DNA sequence 
Nucleic Acid Accession #; NM_001110.1 
Coding sequence: 470.. 2716 

l n 21 31 41 51 

GAATTCGAGG ATCCGGGTAC CATGGGCGGC GGCAGGCCTA GCAGCACGGG AACCGTCCCC 60 

CGCGCGCATG CGCGCGCCCC TGAAGCGCCT GGGGGAOGGG TATGGGCGGG AGGTAGGGGC 12 
GCGGCTCCGC GTGCCAGTTG GGTGCCCGCG CGTCACGTGG TGAGGAAGGA GGCGGAGGTC 

TGAGTTTCGA GGGAGGGGGG GAGAGAAGAG GGAACGAGCA AGGGAAGGAA AGCGGGGAAA _ 

GGAGGAAGGA AACGAACGAG GGGGAGGGAG GTCCCTGTTT TGGAGGAGCT AGGAGCGTTG 300 

CCGGCCCCTG AAGTGGAGCG AGAGGGAGGT GCTTCGCCGT TTCTCCTGCC AGGGGAGGTC 360 

CCGGCTTCCC GTGGAGGCTC CGGACCAAGC CCCTTCAGCT TCTCCCTCCG GATCGATGTG 420 

CTGCTGTTAA CCCGTGAGGA GGCGGCGGGG GCGGCAGCGG CAGCGGAAGA TGGTGTTGCT 4 flu 
GAGAGTGTTA ATTCTGCTCC TCTCCTGGGC GGCGGGGATG GGAGGTCAGT ATGGGAATCC 
TTTAAATAAA TATATCAGAC ATTATGAAGG ATTATCTTAC AATGTGGATT CATTACACCA 
AAAACACCAG CGTGCCAAAA GAGCAGTCTC ACATGAAGAC CAATTTTTAC GTCTAGATTT 
CCATGCCCAT GGAAGACATT TCAACCTACG AATGAAGAGG GACACTTCCC TTTTCAGTGA 
TGAATTTAAA GTAGAAACAT CAAATAAAGT ACTTGATTAT GATACCTCTC ATATTTACAC 
TGGACATATT TATGGTGAAG AAGGAAGTTT TAGCCATGGG TCTGTTATTG ATGGAAGATT 
TGAAGGATTC ATCCAGACTC GTGGTGGCAC ATTTTATGTT GAGCCAGCAG AGAGATATAT 
TAAAGACCGA ACTCTGCCAT TTCACTCTGT CATTTATCAT GAAGATGATA TTAACTATCC 
CCATAAATAC GGTCCTCAGG GGGGCTGTGC AGATCATTCA GTATTTGAAA GAATGAGGAA 

ATACCAGATG ACTGGTGTAG AGGAAGTAAC ACAGATACCT CAAGAAGAAC ATGCTGCTAA 1080 

TGGTCCAGAA CTTCTGAGGA AAAAACGTAC AACTTCAGCT GAAAAAAATA CTTGTCAGCT 1140 

TTATATTCAG ACTGATCATT TGTTCTTTAA ATATTACGGA ACACGAGAAG CTGTGATTGC 1200 

CCAGATATCC AGTCATGTTA AAGCGATTGA TACAATTTAC CAGACCACAG ACTTCTCCGG 1260 

AATCCGTAAC ATCAGTTTCA TGGTGAAACG CATAAGAATC AATACAACTG CTGATGAGAA 1320 

GGACCCTACA AATCCTTTCC GTTTCCCAAA TATTGGTGTG GAGAAGTTTC TGGAATTGAA 1380 
TTCTGAGCAG AATCATGATG ACTACTGTTT GGCCTATGTC TTCACAGACC GAGATTTTGA 1440 
TGATGGCGTA CTTGGTCTGG CTTGGGTTGG AGCACCTTCA GGAAGCTCTG GAGGAATATG 1500 
TGAAAAAAGT AAACTCTATT CAGATGGTAA GAAGAAGTCC TTAAACACTG GAATTATTAC 1560 
TGTTCAGAAC TATGGGTCTC ATGTACCTCC CAAAGTCTCT CACATTACTT TTGCTCACGA 1620 
AGTTCGACAT AACTTTGGAT CCCCACATGA TTCTGGAACA GAGTGCACAC CAGGAGAATC 1680 
TAAGAATTTG GGTCAAAAAG AAAATGGCAA TTACATCATG TATGCAAGAG CAACATCTGG 1740 
GGACAAACTT AACAACAATA AATTCTCACT CTGTAGTATT AGAAATATAA GCCAAGTTCT 1800 
TGAGAAGAAG AGAAACAACT GTTTTGTTGA ATCTGGCCAA CCTATTTGTG GAAATGGAAT I860 
GGTAGAACAA GGTGAAGAAT GTGATTGTGG CTATAGTGAC CAGTGTAAAG ATGAATGCTG 1920 
CTTCGATGCA AATCAACCAG AGGGAAGAAA ATGCAAACTG AAACCTGGGA AACAGTGCAG 1980 
TCCAAGTCAA GGTCCTTGTT GTACAGCACA GTGTGCATTC AAGTCAAAGT CTGAGAAGTG 2040 
TCGGGATGAT TCAGACTGTG CAAGGGAAGG AATATGTAAT GGCTTCACAG CTCTCTGCCC 2100 
AGCATCTGAC CCTAAACCAA ACTTCACAGA CTGTAATAGG CATACACAAG TGTGCATTAA 2160 
TGGGCAATGT GCAGGTTCTA TCTGTGAGAA ATATGGCTTA GAGGAGTGTA CGTGTGCCAG 2220 
TTCTGATGGC AAAGATGATA AAGAATTATG CCATGTATGC TGTATGAAGA AAATGGACCC 2280 
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50 
55 
60 
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75 



ATCAACTTGT 
CACCCTGCAA 
GTGCAGATTA 
AGAGCTCTAT 
GGGAATTGCT 
AAGTAGTAAT 
ACCTCCACAG 
ACACATGAGA 
ACTTCACTCC 
CAGTTGAAGA 
TAAAGCCTGA 
AACAGATATT 
CCTGTTCTTC 
TACCACAAAT 
CACCTCAACA 
TTTCTTACGT 
TCTTTGTATT 
AGGATAACAG 
GTTTTGAAAT 



GCCAGTACAG 
CCTGGATCCC 
GTAGATGCTG 
GAAAACATTG 
CTGATCATGC 
CCAAAGTTGC 
CCCATTCAGC 
CGCTAACTGC 
AAAGAGAAAC 
AAAAATGGCA 
AAATTCCAAT 
TTTAACTTAA 
GTTGCTGCAT 
TGAAATAAAT 
TTGGAGACAT 
ACACTGTACT 
ATAAAATTTT 
AGAATGGTGG 
GAAAATCAAT 



GGTCTGTGCA 
CTTGCAACGA 
ATGGTCCTCT 
CTGAATGGAT 
TAATGGCTGG 
CTCCTCCTAA 
AACCCCAGCG 
AGCTTTTGCC 
CTATTAAGTC 
AGAGATCATA 
TTGGGGGTGG 
TGGCACAAAG 
TTTCTTCACT 
ATATTTTTTT 
CACTTGCCAA 
TCTGTGTGCA 
TCCGCTCTTA 
TATTCAGTGG 
TTACCCCATG 



GTGGAGTAGG 
TTTTAGAGGT 
AGCTAGGCTT 
TGTGGCTCAT 
ATTTATTAAG 
ACCACTTCCA 
TCAGCGGCCC 
TTGGTTCTTC 
ATCATCTCCA 
TCCTCAGACC 
GAGGTGGAAA 
TCTTAGAATA 
TGCAGGCAAA 
CAACTGCCAA 
TGTACATACC 
ATTGTAAACA 
ATTAAAAATT 
TCCAGGATTC 
GTACCCGGAT 



CACTTCAGTG 
TACTGTGATG 
AAAAAAGCAA 
TGGTGGGCAG 
ATATGCAGTG 
GGCACTTTAA 
CGAGAGAGTT 
CTAGTGCCTA 
AACTAAACCC 
AGGTGGAATT 
AGGAACCCAA 
TTATTATGTG 
CTTGGCTCTC 
TCAAGGCTAG 
TTGTTATATG 
GAAATTGCAA 
ACTGTTTAAT 
TGTAATGCTT 
CCTCGAATTC 



GTCGAACCAT 
TTTTCATGCG 
TTTTTAGTCC 
TATTACTTAT 
TTCATACTCC 
AGAGGAGGAG 
ATCAAATGGG 
CAATGGGAAA 
TCACAAGTAA 
ACTTAAATTT 
TTTTCTTATG 
CCCCGTGTTC 
AATAAACTTT 
GAGGCTCGAC 
CAGACATGTA 
TATGGATGTT 
TGACATACTC 
TACACAGGCA 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2880 
2940 
3000 
3060 
3120 
3160 
3240 
3300 
3360 



Seq ID NO: 114 Protein Sequence 
Protein Accession ft: np_001101.i 



1 

I 

MVLLRVLILL 
RLDFHAHGRH 
DGRFEGFIQT 
RMRKYQMTGV 
AVIAQISSHV 
LELNSEQNHD 
GIITVQNYGS 
ATSGDKLNNN 
DECCFDANQP 
ALCPASDPKP 
KMDPSTCAST 
IFSPELYENI 
KRRRPPQPIQ 



11 
i 

LSWAAGMGGQ 
FNLRMKRDTS 
RGGTFYVEPA 
EEVTQIPQEE 
KAIDTIYQTT 
DYCLAYVPTD 
HVPPKVSHIT 
KFSLCSIRNI 
EGRKCKLKPG 
NFTDCNRHTQ 
GSVQWSRHFS 
AEWIVAHWWA 
QPQRQRPRES 



21 

i 

YGNPLNKYIR 
LFSDEFKVET 
ERYIKDRTLP 
HAANGPELLR 
DFSGIRNISF 
RDFDDGVLGL 
FAHEVGHNFG 
SQVLEKKRNN 
KQCSPSQGPC 
VCINGQCAGS 
GRT1TLQPGS 
VLLMGIALIM 
YQMGHMRR 



31 
I 

HYEGLSYNVD 
SNKVLDYDTS 
FHSVIYHEDD 
KKRTTSAEKN 
MVKRIRINTT 
AWVGAPSGSS 
SPHDSGTECT 
CPVESGQPIC 
CTAQCAFKSK 
ICEKYGLEEC 
PCNDFRGYCD 
LMAGFIKICS 



41 
I 

SLHQKHQRAK 
HIYTGHIYGE 
INYPHXYGPQ 
TCQLYIQTDH 
ADEKDPTNPF 
GGICEKSKLY 
PGESKHLGQK 
GNGMVEQGEE 
SEKCRDDSDC 
TCASSDGKDD 
VPMRCRLVDA 
VHTPSSNPKL 



51 
1 

RAVSHEDQFL 
EGSFSHGSVI 
GGCADHSVFE 
LFFKYYGTRE 
RFPNIGVEKF 
SDGKKKSLNT 
ENGNYIMYAR 
CDCGYSDQCK 
AREGICNGPT 
KELCHVCCMK 
DGPLARLKKA 
PPPKPLPGTL 



Seq ID NO: 115 DNA sequence 

Nucleic Acid Accession ft: NM_000577.2 

Coding sequence: 41-520 



45 , 



GGCACGAGGG 
CCGACCCTCT 
GAAGACCTTC 
CAATTTAGAA 
CCATGGAGGG 
GGAGGCAGTT 
CATCCGCTCA 
CCTCTGCACA 
CGTCATGGTC 
TCCCATTCTT 
GCTATGGGGG 
CCTGGTCACA 
GTCTTTCTAA 
TCTGCATTCA 
CTTCCTCCCT 
ACCAAGTGGC 
TTTAAGGGTT 
GAAGGAGAGC 
ATTCCTGCAT 
TTTTTTTGTG 
ATTTTTTTTT 
CCCAGCCTCC 
TCCTATCAGA 
TCTTCCTCTG 
ACTTGTATGA 
GAGCAGGAAA 
CTCTTGGCAG 
CTGTGACTTC 
AAAAAAAAAA 



11 
I 

GAAGACCTCC 
GGGAGAAAAT 
TATCTGAGGA 
GAAAAGATAG 
AAGATGTGCC 
AACATCACTG 
GACAGTGGCC 
GCGATGGAAG 
ACCAAATTCT 
GCATGGCAAG 
CACTGAGGAC 
GGACTCTGCC 
TGTGTGAATC 
GGATCAAACC 
CATTCCACCT 
TCCCACACCC 
TGTGGAAAAT 
CCTTCATTTG 
TTGTGAAATG 
ATGTCCCAAC 
TCCTTTTAAA 
AAGCTCCATC 
AGTTTCTCAG 
CTGAAGGAAT 
AAGATGGCTG 
CATGACTCGT 
GTACTCAGCG 
AGCTCTGTTT 
AAAAAAAAAA 



21 
I 

TGTCCTATCA 
CCAGCAAGAT 
ACAACCAACT 
ATGTGGTACC 
TGTCCTGTGT 
ACCTGAGCGA 
CCACCACCAG 
CTGACCAGCC 
ACTTCCAGGA 
GACTGCAGGG 
CAGCCATTGA 
TCCTCTTCAA 
AGAGCACAGC 
CCGACCACCT 
TCCCATGCCC 
TGTTTTACAA 
GAAAATTAGG 
GAGATTATGT 
ATGGTGAAAG 
TTGTAAAAAT 
ACACTTCCAT 
TCCACTCCAG 
CTCCCAAGGC 
AAATTGCTCC 
TGCCTCTGCC 
ATATGTCTCA 
AATGAATGCT 
TACAATAAAA 
AAAAAAAAAA 



31 
I 

GGCCCTCCCC 
GCAAGCCTTC 
AGTTGCCGGA 
CATTGAGCCT 
CAAGTCTGGT 
GAACAGAAAG 
TTTTGAGTCT 
CGTCAGCCTC 
GGACGAGTAG 
ACTGCCAGTC 
GGGGTGGACC 
CTGACCAGCC 
AGCCCCTGCA 
GCCCAACCTG 
TGGATCCATC 
AAAAGAAAAG 
ATTTCATGAT 
TCTTTCGGGG 
TAAGTGGTAG 
TAAAAGTTAT 
AATCTGGACT 
ATTTTTTACA 
TCTGAGCAAA 
TTGACATTGT 
TGTCTCCCCC 
GGTCCCTGCA 
GTATATGTTG 
TCTTGAAAAT 
AAAAAA 



41 
I 

ATGGCTTTAG 
AGAATCTGGG 
TACTTGCAAG 
CATGCTCTGT 
GATGAGACCA 
CAGGACAAGC 
GCCGCCTGCC 
ACCAATATGC 
TACTGCCCAG 
CCCCTGCCCC 
CTCAGAAGGC 
TCCATGCTGC 
CAAAGCCCTT 
CTCTCCTCTT 
AGGCCACTTG 
ACCAGTCCAT 
T TTT T T T TTT 
AGAGGCTGAG 
CTTTTCCCTT 
GGTACTATGT 
CCTCTGTCCA 
GCTGCCTGCA 
TGTGGCTCCT 
AGAGCTTCTG 
ACCAGGCTGG 
GGGCCAAGCA 
GGTGCAAAGT 
GCCTAAAAAA 



60 

120 ' 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 



51 
1 

AGACGATCTG 
ATGTTAACCA 
GACCAAATGT 
TCTTGGGAAT 
GACTCCAGCT 
GCTTCGCCTT 
CCGGTTGGTT 
CTGACGAAGG 
GCCTGCCTGT 
AGGGCTCCCG 
GTCACAACAA 
CTCCAGAATG 
CCATGTCGCC 
GCCACTGCCT 
ATGACCCCCA 
GAGGGAGGTT 
CAGTCCCCGT 
GACTTAAAAT 
CTTTTTCTTC 
TAGCCCCATA 
GGCACTGCTG 
GTACTTTACC 
GGGGGTTCTT 
GCACTTGGAG 
GAGCTCTGCA 
CCTAGCCTCG 
TCCCTACTTC 
AAAAAAAAAA 



60 

120 

180 

240 

300 

3 60 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 



Seq ID NO: 116 Protein Sequence 
Protein Accession ft: NP_0G0568.1 
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MALETICRPS GRKSSKMQAF RIWDVNQKTP YLRNNQLVAG YLQGPNVNLE EKIDWPIEP 60 
HALPLGIHGG KMCLSCVKSG DETRLQLEAV NITDLSENRK QDKRFAFIRS DSGPTTSFES 120 
AACPGWFLCT AMEADQPVSL TNMPDEGVMV TKFYPQEDE 



Seq ID NO: 117 UNA sequence 

Nucleic Acid Accession 8: CAT cluster 

1 11 21 31 41 51 

I 1 I I I I 

GCACTTTAGC TCAGTGATGG ATTTTAAGAA CGATTACACC TGTCGCCTGT GGTCTGACTC 60 

CAGGCACTCG CGTCAGGTAC TTCTGCTCCA GGATAGCTTT ATGAATTGCT CTGACAGCAT 120 

CATCAATGGT TCCTTTCGTG CGCTTGGCTT TATTCATGAG GCTCAGGTCG GGGAAAGACT 180 

GATGGTCCAC TGTGACAGCA AGACAGGTAA TGCAAATACG GATTTCATCT GGGTGGGTCC 240 

AGATAACAGA CTGCTAGAGC CGGATAAAGA GATGGAAAAC TTTTACGTGT TTCACAATGG 300 

AAGTCTGGTT ATAGAAAGCC CTCGTTTTGA GGATGCTGGA GTGTATTCTT GTATCGCAAT 360 

GAATAAGCAA CGCCTGTTAA ATGAAACTGT GGACGTCACA ATAAATGTGA GCAATTTCAC 420 

TGTAAGCAGA TCCCATGCTC ATGAGGCATT TAACACAGCT TTTACCACTC TTGCTGCTTG 480 

CGTGGCCAGT ATCGTTTTGG TACTTTTGTA CCTCTATCTG ACTCCATGCC CCTGCAAGTG 540 

TAAAACCAAG AGACAGAAAA ATATGCTACA CCAAAGCAAT GCCCATTCAT CGATTCTCAG 600 

TCCTGGCCCC GCTAGTGATG CCTCCGCTGA TGAACGGAAG GCAGGTGCAG GTAAAAGAGT 660 

GGTGTTTTTG GAACCCCTGA AGGATACTGC AGCAGGGCAG AACGGGAAAG TCAGGCTCTT 720 

TCCCAGCGAG GCAGTGATAG CTGAGGGCAT CCTAAAGTCC ACGAGGGGGA AATCTGACTC 780 
AGATTCAGTC AATTCAGTGT TTTCTGACAC ACCTTTTGTG GCGTCCACTT AA 

Seq ID NO: 118 DNA sequence 

Nucleic Acid Accession tt: AB032989.1 

Coding sequence : 1 . . 1316 

1 11 21 31 41 51 

CCCATTCCTT GCCCAGTTAC ACAGCACTAC TGGACCTCAG TCACAACAAC CTGAGCCGCC 60 

TGCGGGCCGA GTGGACCCCC ACGCGCCTGA CCCAACTGCA CTCCCTGCTG CTGAGCCACA 120 

ACCACCTGAA CTTCATCTCC TCTGAGGCCT TTTCCCCGGT ACCCAACCTG CGCTACCTGG 180 

ACCTCTCCTC CAACCAGCTG CGTACACTGG ATGAGTTCCT GTTCAGTGAC CTGCAAGTAC 240 

TGGAGGTGCT GCTGCTCTAC AATAACCACA TCATGGCGGT GGACCGGTGC GCCTTCGATG 300 

ACATGGCCCA GCTGCAGAAA CTCTACTTGA GCCAGAACCA GATCTCTCGC TTCCCTCTGG 360 

AACTGGTCAA GGAAGGAGCC AAGCTACCCA AACTAACGCT CCTGGATCTC TCTTCTAACA 420 

AGCTGAAGAA CTTGCCATTG CCTGACCTGC AGAAGCTGCC GGCCTGGATC AAGAATGGGC 480 

TGTACCTACA TAACAACCCC CTGAACTGCG ACTGTGAGCT CTACCAGCTG TTTTCACACT 540 

GGCAGTATCG GCAGCTGAGC TCCGTGATGG ACTTTCAAGA GGATCTGTAC TGCATGAACT 600 

CCAAGAAGCT GCACAATGTC TTCAACCTGA GTTTCCTCAA CTGTGGCGAG TACAAGGAGC 660 

GTGCCTGGGA GGCCCACCTG GGTGACACCT TGATCATCAA GTGTGACACC AAGCAGCAAG 720 

GGATGACCAA GGTGTGGGTG ACACCAAGTA ATGAACGGGT GCTAGATGAG GTGACCAATG 780 

GCACAGTGAG TGTGTCTAAG GATGGCAGTC TTCTTTTCCA GCAGGTGCAG GTCGAGGACG 840 

GTGGTGTGTA TACCTGCTAT GCCATGGGAG AGACTTTCAA TGAGACACTG TCTGTGGAAT 900 

TGAAAGTGCA CAATTTCACC TTGCAOGGAC ACCATGACAC CCTCAACACA GCCTATACCA 960 
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CTCCATCGAG AGGTCTTTGC AGTTGTGGTT GTGAGGCCAG AGGAACTTAA GAAGAATGAG 2760 

CTCAACATTC CATTCCATCC AAGGTTGCAG AATACTTTGA GTATGGAGAT GACCGATCCC 2820 

TTTAACCTTT CCCTTTGACT ATACTGCATG CCCTGTGAGT GGCTGCTCTG AGCTTTGGCG 2880 

GGGGATGAGC AGGGTGAAGG TGAAAATGTT CTCTGATGTT GGTTTTTCTT AGTGTCTCTT 2940 

5 CTCTCCCACG TTTCTAGTCC ATTGGAAGTT CTCCAGATCC TCTAGATTTC TGACCTCTTT 3000 

TTACATTCTA AGAGGTGTCA TTTACCTGTG GGTGAGGGGC TTGGAGGGCC TGACTATTAA 3060 

GCTTCCTATT ATGATTGGGG AAGGGGACTC AACATCTTCC CCTTGTCTGG TCCTGGGAAC 3120 

TCAAGACTGC TGTCCATCCT TTATTAGTAA CCTAGTTTTG TTGAGAGAGA GGTGCAGCAT 3180 

TTCCCTCAAG TACCTGCATG CTCATAGGTA TGCATGCTTA TAGGTATGCA TGCACACCAG 3240 

10 CACCTTGACT TCCACCAGGA CAGTGAATCT GTAGTCCCCA TTAATGATAC TCTCAGAATG 3300 

GTCTCCACCA ATGTGCACAA CTTCATGCAC ACAATTGTAT TCCTCTTTAG GAGAGACAGT 3360 

GACCATGCCA GGCTCTTCAA ATACACACTC CTGTATTTTC ATTTCTGAAC TCCCTGAAGC 3420 

TTCACCCACA TCTCCATGGC ACTGAATCAC TTTTCTGCCC AGGCTGGAGT TAGAAGAACC 3480 

ATGGGTTGAA GGCCTTGGGA GAAGAGGTGG TAAGGAGCTA CCTGGTTCTT TGGACTCCTC 3540 

15 AGAAACCTCC CGTGAAGACT ATATATATGA GCAATATATG AGCAATTAAG ATAGAACTAG 3600 

GTCATTGAAG GACTCAGAAG GAAACTGGCA AGGGGCTAAG TAGCTGTGAG CTGCTCTTCC 3660 

TGTTAACTGT CCTCAGGAGA GAAAAGTTGT CTTGATTTAT TCCAGAGACT CCTGAGAATT 3720 

TTGTTACTGG AGCTATAGGA ACTAGGACTC CCTCCTCAGA CTCATCAGGA AATCTG GGAA 3780 

AATCTCATCT CCATGCTGCA GTCTCCCCGC TTGGCACCAT CTAGGGCTCT GAAGATCTTT 3840 

20 AAAGTGAGCG TTAAATGAAG CAGGCCTTCT GGGGCATCGT GATGGAGCTC CTGAGGACTG 3900 

GGAGGGCCGT TTTCTTCATA ATTCCTACCC AGTTTAGGGA ATGAGAAAAG GCCCTGTGGC 3960 

TCTTAAGTCT CCTGACTATA TCCACTAGTT TGGCCTTGAG TTTGAGACAC CTGGTAGATG 4020 

TACCTGGCTG CCCTACCAAG AGATTTGATT TGTTTCTTCC CGTGGTTTCA GGTGATAGAA 4080 

CCCAAAGGTA GCTTTTTAAT TGGAGAATTG AATAATCAGC ATATGTAAGC GCACTAGAAC 4140 

25 CCTGTGTTGA AAACTGCCAG GTGTGGGTAT AAGAAAAGGC CGAGAGATCA CCACCTCCTC 4200 

TTACCCTACC CCACACCCAT AAACCAGACA TGTCTCCCAG GAAGCAGGTG TCCCTGGACA 4260 

CAGAGGATGA CAGGGCTCTA CAATCTGTGT AATTATTATT ATTATTATTA TTTTGTATTT 4320 

ATGGGGCCCA AGAAAGGGGC CAGGAGAGGG TACACCCCAG CTGGGGAGAG CAAAGCAGAT 4380 

GGATCCAGTT TTCTGTGTGT TCTTACCTCT GTACTTCCTC GTAGCTCTGC TGACAAAGCA 4440 

30 AGCAGGCCTC CCGTGTCCAA GACCCCATTC CTCCCACTTG TGTACACCTA GGCTGGCAAA 4500 

TCTTGGAGCC TCTGGGCTCT GAAAACTAGA CAATGATCAT TAAACCTGGC TTGAGTCTCT 4560 
GTTCTGGC 

Seq ID NO: 119 Protein Sequence 
35 Protein Accession #: BAA86477.1 

1 11 21 31 41 51 

I I I I I I 

HSLPSYTALL DLSHNNLSRL RAEWTPTRLT QLHSLLLSHN HLNFISSEAF SPVPNLRYLD 60 

40 LSSNQLRTLD EFLFSDLQVL EVLLLYNNH I MAVDRCAFDD MAQLQKLYLS QNQISRFPLE 120 

LVKEGAKLPK LTLLDLSSNK LKNLPLPDLQ KLPAWIKNGL YLHNNPLNCD CELYQLFSHW 180 

QYRQLSSVMD FQEDLYCMNS KKLHNVFNLS FLNCGEYKER AWEAHLGDTL IIKCDTKOQG 240 

MTKVWVTPSN ERVLDEVTNG TVSVSKDGSL LFQQVQVEDG GVYTCYAMGE TFKETLSVEL 300 

KVHNFTLHGH HDTLNTAYTT LVGCILSWL VLIYLYLTPC RCWCRGVEKP SSHQGDSLSS 360 

45 SMLSTTPNHD PMAGGDKDDG FDRRVAFLEP AGPGQGQNGK LKPGNTLPVP EATGKGQRRM 420 
SDPESVSSVF SDTPIW 

Seq ID NO: 120 DNA sequence 
Nucleic Acid Accession #: NM_005071 
50 Coding sequence: 9.. 1703 

1 11 21 31 41 51 

! I I I I I 

GATAGACCAT GAGCAGCCAT GGCAACAGCC TGTTCCTTCG GGAGAGCGGC CAGCGGCTGG 60 

55 GCCGGGTGGG CTGGCTGCAG CGGCTGCAGG AAAGCCTGCA GCAGAGAGCA CTGCGCACGC 120 

GCCTGCGCCT GCAGACCATG ACCCTCGAGC ACGTGCTGCG CTTCCTGCGC CGAAACGCCT 180 

TCATTCTGCT GACGGTCAGC GCCGTGGTCA TTGGGGTCAG CCTGGCCTTT GCCCTGCGCC 240 

CATATCAGCT CACCTACCGC CAGATCAAGT ACTTCTCTTT TCCTGGAGAG CTTCTGATGA 300 

GGATGCTGCA GATGCTGGTG TTACCTCTCA TTGTCTCCAG CCTGGTCACA GGTATGGCAT 360 

60 CCCTGGACAA CAAGGCCACG GGGCGGATGG GGATGCGGGC AGCTGTGTAC TACATGGTGA 420 

CCACCATCAT CGCGGTCTTC ATCGGCATCC TCATGGTCAC CATCATCCAT CCCGGGAAGG 480 

GCTCCAAGGA GGGGCTGCAC CGGGAGGGCC GGATCGAGAC CATCCCCACA GCTGATGCCT 540 

TCATGGACCT GATCAGAAAT ATGTTTCCAC CAAACCTTGT GGAGGCCTGC TTCAAACAGT 600 

TCAAGACGCA GTACAGCACG AGGGTGGTAA CCAGGACCAT GGTGAGGACA GAGAACGGGT 660 

65 CTGAGCCGGG TGCCTCCATG CCTCCTCCAT TCTCAGTGGA GAACGGAACC AGCTTCCTGG 720 

AAAATGTCAC TCGGGCCTTG GGTACCCTGC AGGAGATGCT GAGCTTTGAG GAGACTGTAC 780 

CCGTGCCTGG CTCCGCCAAT GGCATCAACG CCCTGGGCCT CGTGGTCTTC TCTGTGGCCT 840 

TTGGGCTGGT CATTGGTGGC ATGAAACACA AGGGCAGAGT CCTCAGGGAC TTCTTCGACA 900 
GCCTCAATGA GGCTATTATG AGGCTGGTGG GCATCATTAT CTGGTATGCA CCTGTGGGCA 960 

70 TCCTGTTCCT GATTGCTGGG AAGATTCTGG AGATGGAAGA CATGGCCGTC CTGGGGGGTC 1020 

AGCTGGGCAT GTACACCCTG ACCGTCATCG TGGGCCTGTT CCTCCATGCC GGCATTGTCC 1080 
TTCCCCTCAT CTACTTCCTC GTCACTCACC GGAACCCCTT CCCCTTCATT GGGGGCATGC 1140 
TACAAGCCCT CATCACCGCT ATGGGCACGT CTTCCAGCTC GGCAACGCTG CCCATCACCT 1200 
TCCGCTGCCT GGAGGAGGGC CTGGGTGTGG ACCGCCGCAT CACCAGGTTC GTCCTGCCCG 1260 

75 TGGGCGCCAC GGTCAACATG GATGGCACTG CCCTCTACGA GGCCCTGGCT GCCATCTTCA 1320 
TTGCTCAAGT TAACAACTAC GAGCTCAACC TGGGTCAGAT CACAACCATC AGCATCACGG 1380 
CCACAGCAGC CAGTGTTGGG GCTGCTGGCA TCCCCCAGGC GGGTCTGGTC ACCATGGTCA 1440 
TTGTGCTTAC GTCGGTCGGC TTGCCCACGG AAGACATCAC GCTCATCATC GCCGTGGACT 1S00 
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5 



GGTTCCTTGA CCGGCTTCGC ACAATGACCA ACGTACTGGG GGACTCAATT GGAGCGGCCG 1560 

TCATCGAGCA CTTGTCTCAG CGGGAGCTGG AGCTTCAGGA AGCTGAGCTT ACCCTCCCCA 1620 

GCCTGGGGAA ACCCTACAAG TCCCTCATGG CACAGGAGAA GGGGGCATCC CGGGGACGGG 1680 
GAGGCAACGA GAGTGCTATG TGAGGGGCCT CCAGCTCTG 

Seq ID NO: 121 Protein Sequence 
Protein Accession ti t NPJJ05062 

1 11 21 31 41 51 

10 | | I I II 

MSSHGNSLFL RESGQRIiGRV GWLQRLQESL QQRALRTRLR LQTMTLEHVL RFLRRNAFIL 60 

LTVSAWIGV SLAFAIjRPYQ LTYRQIKYFS FPGBLIiMRML QMLVLPLIVS SLVTGMASLD 120 

NKATGRMGMR AAVYYMVTTI IAVFIGILMV TIIHPGKGSK EGLHREGRIE TIPTADAFMD 180 

LIRNMFPPNL VEACFKQFKT QYSTRWTRT MVRTENGSEP GASMPPPFSV ENGTSFLENV 240 

15 TRALGTLQEM LSFEETVPVP GSANGINALG LWFSVAFGL VIGGMKHKGR VLRDFFDSLN 300 

EAIMRLVGII IWYAPVGIIjF LIAGKILEME DMAVLGGQLG MYTLTVIVGL FLHAGIVLPL 360 

IYFLVTHRNP FPFIGGMLQA LITAMGTSSS SATLPITPRC LEEGLGVDRR ITRFVLPVGA 420 

TVNMDGTALY EALAAIFIAQ VNNYELNLGQ ITTISITATA ASVGAAGIPQ AGLVTMVIVL 480 

TSVGLPTEDI TLIIAVDWFL DRLRTMTNVL GDSIGAAVIE HLSQRELELQ EAELTLPSLG 540 
20 KPYKSLMAQE KGASRGRGGN ESAM 



25 



55 



70 



Seq ID NO; 122 DNA sequence 

Nucleic Acid Accession #: NM_031966.1 

Coding sequence: 178.. 1479 



Seq ID NO.« 123 Protein Sequence 
protein Accession ih NPJL14172.1 



1 11 21 31 41 51 

| I I I I I 

ACGAACAGGC CAATAAGGAG GGAGCAGTGC GGGGTTTAAA TCTGAGGCTA GGCTGGCTCT 60 

TCTCGGCGTG CTGCGGCGGA ACGGCTGTTG GTTTCTGCTG GTTGTAGGTC CTTGGCTGGT 120 

30 CGGGCCTCCG GTGTTCTGCT TCTCCCCGCT GAGCTGCTGC CTGGTGAAGA GGAAGCCATG 180 

GCGCTCCGAG TCACCAGGAA CTCGAAAATT AATGCTGAAA ATAAGGCGAA GATCAACATG 240 

GCAGGCGCAA AGCGCGTTCC TACGGCCCCT GCTGCAACCT CCAAGCCCGG ACTGAGGCCA 300 

AGAACAGCTC TTGGGGACAT TGGTAACAAA GTCAGTGAAC AACTGCAGGC CAAAATGCCT 360 

ATGAAGAAGG AAGCAAAACC TTCAGCTACT GGAAAAGTCA TTGATAAAAA ACTACCAAAA 420 

35 CCTCTTGAAA AGGTACCTAT GCTGGTGCCA GTGCCAGTGT CTGAGCCAGT GCCAGAGCCA 480 

GAACCTGAGC CAGAACCTGA GCCTGTTAAA GAAGAAAAAC TTTCGCCTGA GCCTATTTTG 540 

GTTGATACTG CCTCTCCAAG CCCAATGGAA ACATCTGGAT GTGCCCCTGC AGAAGAAGAC 600 

CTGTGTCAGG CTTTCTCTGA TGTAATTCTT GCAGTAAATG ATGTGGATGC AGAAGATGGA 660 

GCTGATCCAA ACCTTTGTAG TGAATATGTG AAAGATATTT ATGCTTATCT GAGACAACTT 720 

40 GAGGAAGAGC AAGCAGTCAG ACCAAAATAC CTACTGGGTC GGGAAGTCAC TGGAAACATG 780 

AGAGCCATCC TAATTGACTG GCTAGTACAG GTTCAAATGA AATTCAGGTT GTTGCAGGAG 840 

ACCATGTACA TGACTGTCTC CATTATTGAT CGGTTCATGC AGAATAATTG TGTGCCCAAG 900 

AAGATGCTGC AGCTGGTTGG TGTCACTGCC ATGTTTATTG CAAGCAAATA TGAAGAAATG 960 

TACCCTCCAG AAATTGGTGA CTTTGCTTTT GTGACTGACA ACACTTATAC TAAGCACCAA 1020 

45 ATCAGACAGA TGGAAATGAA GATTCTAAGA GCTTTAAACT TTGGTCTGGG TCGGCCTCTA 1080 

CCTTTGCACT TCCTTCGGAG AGCATCTAAG ATTGGAGAGG TTGATGTCGA GCAACATACT 1140 

TTGGCCAAAT ACCTGATGGA ACTAACTATG TTGGACTATG ACATGGTGCA CTTTCCTCCT 1200 

TCTCAAATTG CAGCAGGAGC TTTTTGCTTA GCACTGAAAA TTCTGGATAA TGGTGAATGG 1260 

ACACCAACTC TACAACATTA CCTGTCATAT ACTGAAGAAT CTCTTCTTCC AGTTATGCAG 1320 

50 CACCTGGCTA AGAATGTAGT CATGGTAAAT CAAGGACTTA CAAAGCACAT GACTGTCAAG 1380 

AACAAGTATG CCACATCGAA GCATGCTAAG ATCAGCACTC TACCACAGCT GAATTCTGCA 1440 

CTAGTTCAAG ATTTAGCCAA GGCTGTGGCA AAGGTGTAAC TTGTAAACTT GAGTTGGAGT 1500 

ACTATATTTA CAAATAAAAT TGGCACCATG TGCCATCTGT AAAAAAAAAA AAAAAAAAAA 1560 
AAAAAAAAAA AAAAAAAA 



1 li 21 31 41 51 

60 | i I I II 

MALRVTRNSK INABNKAKIN MAGAKRVPTA PAATSKPGLR PRTALGDIGN KVSEQLQAKM 60 

PMKKEAKPSA TGKVIDKKLP KPLEKVPMLV PVPVSEPVPE PEPEPEPEPV KEEKLSPEPI 120 

LVDTASPSPM ETSGCAPAEE DLCQAFSDVI LAVNDVDAED GADPNLCSEY VKDIYAYLRQ 180 

LEEEQAVRPK YLIiGREVTGN MRAILIDWLV QVQMKFRLLQ ETMYMTVSII DRFNQNNCVP 240 

65 KKMLQLVGVT AMPIASKYEE MYPPEIGDFA FVTDNTYTKH QIRQMEMKIL RALNFGLGRP 300 

IiPLHPLRRAS KIGEVDVEQH TLAKYLMELT MLDYDMVHFP PSQIAAGAFC LALKILDNGE 360 

WTPTLQHYLS YTEESLLPVM QHIAKNWMV NQGLTKHMTV KNKYATSKHA KISTLPQLNS 420 
ALVQDLAKAV AKV 



Seq ID NO: 124 DNA sequence 

Nucleic Acid Accession 8: Eos sequence 

Coding sequence: 13-1424 



75 1 11 21 31 41 SI 

TAGAAGTTTA CAATGAAGTT TCTTCTAATA CTGCTCCTGC AGGCCACTGC TTCTGGAGCT 60 
CTTCCCCTGA ACAGCTCTAC AAGCCTGGAA AAAAATAATG TGCTATTTGG TGAAAGATAC 120 
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TTAGAAAAAT TTTATGGCCT TGAGATAAAC AAACTTCCAG TGACAAAAAT GAAATATAGT 180 

GGAAACTTAA TGAAGGAAAA AATCCAAGAA ATGCAGCACT TCTTGGGTCT GAAAGTGACC 240 

GGGCAACTGG ACACATCTAC CCTGGAGATG ATGCACGCAC CTCGATGTGG AGTCCCCGAT 300 

GTCCATCATT TCAGGGAAAT GCCAGGGGGG CCCGTATGGA GGAAACATTA TATCACCTAC 360 

AGAATCAATA ATTACACACC TGACATGAAC CGTGAGGATG TTGACTACGC AATCCGGAAA 420 

GCTTTCCAAG TATGGAGTAA TGTTACCCCC TTGAAATTCA GCAAGATTAA CACAGGCATG 480 

GCTGACATTT TGGTGGTTTT TGCCCGTGGA GCTCATGGAG ACTTCCATGC TTTTGATGGC 540 

AAAGGTGGAA TCCTAGCCCA TGCTTTTGGA CCTGGATCTG GCATTGGAGG GGATGCACAT 600 

TTCGATGAGG ACGAATTCTG GACTACACAT TCAGGAGGCA CAAACTTGTT CCTCACTGCT 660 

GTTCACGAGA TTGGCCATTC CTTAGGTCTT GGCCATTCTA GTGATCCAAA GGCCGTAATG "720 

TTCCCCACCT ACAAATATGT TGACATCAAC ACATTTCGCC TCTCTGCTGA TGACATACGT 780 

GGCATTCAGT CCCTGTATGG AGACCCAAAA GAGAACCAAC GCTTGCCAAA TCCTGACAAT 840 

TCAGAACCAG CTCTCTGTGA CCCCAATTTG AGTTTTGATG CTGTCACTAC CGTGGGAAAT 900 

AAGATCTTTT TCTTCAAAGA CAGGTTCTTC TGGCTGAAGG TTTCTGAGAG ACCAAAGACC 960 

AGTGTTAATT TAATTTCTTC CTTATGGCCA ACCTTGCCAT CTGGCATTGA AGCTGCTTAT 1020 

GAAATTGAAG CCAGAAATCA AGTTTTTCTT TTTAAAGATG ACAAATACTG GTTAATTAGC 1080 

AATTTAAGAC CAGAGCCAAA TTATCCCAAG AGCATACATT CTTTTGGTTT TCCTAACTTT 1140 

GTGAAAAAAA TTGATGCAGC TGTTTTTAAC CCACGTTTTT ATAGGACCTA CTTCTTTGTA 1200 

GATAACCAGT ATTGGAGGTA TGATGAAAGG AGACAGATGA TGGACCCTGG TTATCCCAAA 1260 

CTGATTACCA AGAACTTCCA AGGAATCGGG CCTAAAATTG ATGCAGTCTT CTACTCTAAA 1320 

AACAAATACT ACTATTTCTT CCAAGGATCT AACCAATTTG AATATGACTT CCTACTCCAA 13 BO 

CGTATCACCA AAACACTGAA AAGCAATAGC TGGTTTGGTT GTTGAAAATG GTGTAATTAA 1440 

TGGTTTTTGT TAGTTCACTT CAGCTTAATA AG TAT TT ATT GCATATTTGC TATGTCCTCA 1500 

GTGTACCACT ACTTAGAGAT ATGTATCATA AAAATAAAAT CTGTAAACCA TAGGTAATGA 1560 

TTATATAAAA TACATAATAT TTTTCAATTT TGAAAACTCT AATTGTCCAT TCTTGCTTGA 1620 

CTCTACTATT AAGTTTGAAA ATAGTTACCT TCAAAGCAAG ATAATTCTAT TTGAAGCATG 1680 

CTCTGTAAGT TGCTTCCTAA CATCCTTGGA CTGAGAAATT ATACTTACTT CTGGCATAAC 1740 
TAAAATTAAG TATATATATT TTGGCTCAAA TAAAATTG 

Seg ID NO: 125 Protein sequence: 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I I I 

MKFLLILLLQ ATASGAIiPLN SSTSIiEKNNV LPGERYLEKF YGLEINKLPV TKMKYSGNLM 60 

KEKIQEMQHF LGLKVTGQLD TSTLEMMHAP RCGVPDVHHF REMPGGPVWR KHYITYRINN 120 

YTPDMNREDV DYAIRKAPQV WSNVTPLKFS KINTGMADIL WFARGAHGD FHAFDGKGGI 180 

liAHAFGPGSG IGGDAHFDED EFWTTHSGGT NLFLTAVHEI GHSLGLGHSS DPXAVMPPTY 240 

KYVDINTFRL SADDIRGIQS LYGDPKENQR LPNPDNSEPA LCDPNLSFDA VTTVGNKIFF 300 

FKDRFFWLKV SERPKTSVNL ISSLWPTLPS GIEAAYEIEA RNQVFLPKDD KYWLISNLRP 360 

EPNYPKSIHS FGFPNFVKKI DAAVFNPRFY RTYFFVDNQY WRYDERRQMM DPGYPKLITK 420 
NFQGIGPKID AVFYSKNKYY YFFQGSNQFE YDFLLQRITK TLKSNSWFGC 

Seq ID NO: 126 DNA sequence 

Nucleic Acid Accession ft: NM_000077.2 

Coding sequence t 2 72.. 742 

1 11 21 31 41 51 

| | I I I I 

CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 60 

TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TOCCTCCAGA 120 

GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 180 

GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GCGGAGAGGG 240 

GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGCGGGGA GCAGCATGGA 300 

GCCTTCGGCT GACTGGCTGG CCACGGCCGC GGCCCGGGGT CGGGTAGAGG AGGTGCGGGC 360 

GCTGCTGGAG GCGGGGGCGC TGCCCAACGC ACCGAATAGT TACGGTCGGA GGCCGATCCA 420 

GGTCATGATG ATGGGCAGCG CCCGAGTGGC GGAGCTGCTG CTGCTCCACG GCGCGGAGCC 4 80 

CAACTGCGCC GACCCCGCCA CTCTCACCCG ACCCQTGCAC GACGCTGCCC GGGAGGGCTT 540 

CCTGGACACG CTGGTGGTGC TGCACCGGGC CGGGGCGCGG CTGGACGTGC GCGATGCCTG 600 

GGGCCGTCTG CCCGTGGACC TGGCTGAGGA GCTGGGCCAT CGCGATGTCG CACGGTACCT 660 

GCGCGCGGCT GCGGGGGGCA CCAGAGGCAG TAACCATGCC CGCATAGATG CCGCGGAAGG 720 

TCCCTCAGAC ATCCCCGATT GAAAGAACCA GAGAGGCTCT GAGAAACCTC GGGAAACTTA 7 B0 

GATCATCAGT CACOGAAGGT CCTACAGGGC CACAACTGCC CCCGCCACAA CCCACCCCGC 840 

TTTCGTAGTT TTCATTTAGA AAATAGAGCT TTTAAAAATG T CCTGC CTTT TAACGTAGAT 900 

ATATGCCTTC CCCCACTACC GTAAATGTCC ATTTATATCA TTTTTTATAT ATTCTTATAA 960 

AAATGTAAAA AAGAAAAACA CCGCTTCTGC CTTTTCACTG TGTTGGAGTT TTCTGGAGTG 1020 

AGCACTCACG CCCTAAGCGC ACATTCATGT GGGCATTTCT TGCGAGCCTC GCAGCCTCCG 10 B0 

GAAGCTGTCG ACTTCATGAC AAGCATTTTG TGAACTAGGG AAGCTCAGGG GGGTTACTGG 1140 

CTTCTCTTGA GTCACACTGC TAGCAAATGG CAGAACCAAA GCTCAAATAA AAATAAAATA 1200 
ATTTTCATTC ATTCACTC 

Seq ID NO: 127 Protein Sequence 
Protein Accession ft: NP_000068.1 

1 11 21 31 41 51 

| | I I f I 

MEPAAGSSME PSADWLATAA ARGRVEEVRA LLEAGALPNA PNSYGRRPIQ VMMMGSARVA 60 

ELLL.LHGAEP NCADPATLTR PVHDAAREGF LDTLWLHRA GARLDVRDAW GRLPVDLAEE 120 
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LGHRDVARYL RAAAGGTRGS NHARIDAAEG PSDIPD 

Seq ID NO: 128 DNA sequence 
Nucleic Acid Accession »: NM_058196.1 
5 Ceding sequence: 104.. 421 

1 11 21 31 41 51 

I I I I I I 

TGTGTGGGGG TCTGCTTGGC GGTGAGGGGG CTCTACACAA GCTTCCTTTC CGTCATGCCG 60 

10 GCCCCCACCC TGGCTCTGAC CATTCTGTTC TCTCTGGCAG GTCATGATGA TGGGCAGCGC 120 

CCGAGTGGCG GAGCTGCTGC TGCTCCACGG CGCGGAGCCC AACTGCGCCG ACCCCGCCAC 180 

TCTCACCCGA CCCGTGCACG ACGCTGCCCG GGAGGGCTTC CTGGACACGC TGGTGGTGCT 240 

GCACCGGGCC GGGGCGCGGC TGGACGTGCG CGATGCCTGG GGCCGTCTGC CCGTGGACCT 300 

GGCTGAGGAG CTGGGCCATC GCGATGTCGC AGGGTACCTG CGCGCGGCTG CGGGGGGCAC 360 

15 CAGAGGCAGT AACCATGCCC GCATAGATGC CGCGGAAGGT CCCTCAGACA TCCCCGATTG 420 

AAAGAACCAG AGAGGCTCTG AGAAACCTCG GGAAACTTAG ATCATCAGTC ACCGAAGGTC 4 80 

CTACAGGGCC ACAACTGCCC CCGCCACAAC CCACCCCGCT TTCGTAGTTT TCATTTAGAA 540 

AATAGAGCTT TTAAAAATGT CCTGCCTTTT AACGTAGATA TAAGCCTTCC CCCACTACCG 600 

TAAATGTCCA TTTATATCAT TTTTTATATA TTCTTATAAA AATGTAAAAA AGAAAAACAC 660 

20 CGCTTCTGCC TTTTCACTGT GTTGGAGTTT TCTGGAGTGA GCACTCACGC CCTAAGCGCA 720 

CATTCATGTG GGCATTTCTT GCGAGCCTCG CAGCCTCCGG AAGCTGTCGA CTTCATGACA 780 

AGCATTTTGT GAACTAGGGA AGCTCAGGGG GGTTACTGGC TTCTCTTGAG TCACACTGCT 840 
AGCAAATGGC AGAACCAAAG CTCAAATAAA AATAAAATAA TTTTCATTCA TTCACTC 

25 Seq ID NO i 129 Protein Sequence 
Protein Accession #: NP_47B103.1 

1 11 21 31 41 51 

an I I I I II 

JU MMMGSARVAE LLLLHGAEPN CADPATLTRP VHDAAREGFL DTLWLHRAG ARLDVRDAWG 60 
RLPVDLAEEL GHRDVARYLR AAAGGTRGSN HARIDAAEGP SDIPD 

Seq ID NO: 130 DNA sequence 
Nucleic Acid Accession #: NM_058197.1 
35 Coding sequence: 272.. 646 

1 11 21 31 41 51 

I I I I I I 

CCCAACCTGG GGCGACTTCA GGTGTGCCAC ATTCGCTAAG TGCTCGGAGT TAATAGCACC 60 

40 TCCTCCGAGC ACTCGCTCAC GGCGTCCCCT TGCCTGGAAA GATACCGCGG TCCCTCCAGA 120 

GGATTTGAGG GACAGGGTCG GAGGGGGCTC TTCCGCCAGC ACCGGAGGAA GAAAGAGGAG 180 

GGGCTGGCTG GTCACCAGAG GGTGGGGCGG ACCGCGTGCG CTCGGCGGCT GOGGAGAGGG 240 

GGAGAGCAGG CAGCGGGCGG CGGGGAGCAG CATGGAGCCG GCGGCGGGGA GCAGCATGGA 300 

GCCGGCGGCG GGGAGCAGCA TGGAGCCTTC GGCTGACTGG CTGGCCACGG CCGCGGCCCG 360 

45 GGGTCGGGTA GAGGAGGTGC GGGCGCTGCT GGAGGCGGGG GCGCTGCCCA ACGCACCGAA 420 

TAGTTACGGT CGGAGGCCGA TCCAGGTGGG TAGAAGGTCT GCAGCGGGAG CAGGGGATGG 480 

CGGGCGACTC TGGAGGACGA AGTTTGCAGG GGAATTGGAA TCAGGTAGCG CTTCGATTCT 540 

CCGGAAAAAG GGGAGGCTTC CTGGGGAGTT TTCAGAAGGG GTTTGTAATC ACAGACCTCC 600 

TCCTGGCGAC GCCCTGGGGG CTTGGGAAAC CAAGGAAGAG GAATGAGGAG CCACGCGCGT 660 

50 ACAGATCTCT CGAATGCTGA GAAGATCTGA AGGGGGGAAC ATATTTGTAT TAGATGGAAG 720 

TCATGATGAT GGGCAGCGCC CGAGTGGCGG AGCTGCTGCT GCTCCACGGC GCGGAGCCCA 780 

ACTGCGCCGA CCCCGCCACT CTCACCCGAC CCGTGCACGA CGCTGCCCGG GAGGGCTTCC 840 

TGGACACGCT GGTGGTGCTG CACCGGGCCG GGGCGCGGCT GGACGTGCGC GATGCCTGGG 900 

GCCGTCTGCC CGTGGACCTG GCTGAGGAGC TGGGCCATCG CGATGTCGCA CGGTACCTGC 960 

55 GCGCGGCTGC GGGGGGCACC AGAGGCAGTA ACCATGCCCG CATAGATGCC GCGGAAGGTC 1020 

CCTCAGACAT CCCCGATTGA AAGAACCAGA GAGGCTCTGA GAAACCTCGG GAACTTAGAT 1080 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT 1140 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 1200 

TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT TTTATATATT CTTATAAAAA 1260 

60 TGTAAAAAAG AAAAACACOG C' rAVi t J C VlT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 1320 

ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1380 

GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1440 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 1500 
TTCATTCATT CACTC 



65 



Seq ID NO: 131 Protein Sequence 
Protein Accession #: NP_478104.1 



1 11 21 31 41 51 

70 i i i I i I 

MEPAAGSSME PAAGSSMEPS ADWLATAAAR GRVEEVRALL BAGALPNAPN SYGRRPIQVG 60 
RRSAAGAGDG GRLWRTKFAG ELESGSASIL RKKGRLPGEF SEGVCNHRPP PGDALGAWET 120 
KEEE 

75 seq ID NO: 132 DNA sequence 

Nucleic Acid Accession #: NM_058195.1 
Coding sequence: 163.. 684 
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WO 03/003906 



PCT/US02/21338 



1 11 21 31 41 51 

I I I I I I 

CCTCCCTACG GGCGCCTCCG GCAGCCCTTC CCGCGTGCGC AGGGCTCAGA GCCGTTCCGA 60 

GATCTTGGAG GTCCGGGTGG GAGTGGGGGT GGGGTGGGGG TGGGGGTGAA GGTGGGGGGC 120 

5 GGGOGCGCTC AGGGAAGGCG GGTGCGCGCC TGCGGGGCGG AGATGGGCAG GGGGCGGTGC 180 

GTGGGTCCCA GTCTGCAGTT AAGGGGGCAG GAGTGGOGCT GCTCACCTCT GGTGCCAAAG 240 

GGCGGCGCAG CGGCTGCCGA GCTCGGCCCT GGAGGCGGCG AGAACATGGT GOGCAGGTTC 300 

TTGGTGACCC TCCGGATTCG GCGCGCGTGC GGCCOGCCGC GAGTGAGGGT TTTCGTGGTT 360 

CACATCCCGC GGCTCACGGG GGAGTGGGCA GCGCCAGGGG CGCCCGCCGC TGTGGCCCTC 420 

10 GTGCTGATGC TACTGAGGAG CCAGCGTCTA GGGCAGCAGC CGCTTCCTAG AAGACCAGGT 480 

CATGATGATG GGCAGCGCCC GAGTGGCGGA GCTGCTGCTG CTCCACGGCG CGGAGCCCAA 540 

CTGCGCCGAC CCCGCCACTC TCACCCGACC CGTGCACGAC GCTGCCCGGG AGGGCTTCCT 600 

GGACACGCTG GTGGTGCTGC ACCGGGCCGG GGCGCGGCTG GACGTGCGCG ATGCCTGGGG 660 

CCGTCTGCCC GTGGACCTGG CTGAGGAGCT GGGCCATCGC GATGTCGCAC GGTACCTGCG 720 

15 CGCGGCTGCG GGGGGCACCA GAGGCAGTAA CCATGCCCGC ATAGATGCCG CGGAAGGTCC 780 

CTCAGACATC CCCGATTGAA AGAACCAGAG AGGCTCTGAG AAACCTCGGG AAACTTAGAT 840 

CATCAGTCAC CGAAGGTCCT ACAGGGCCAC AACTGCCCCC GCCACAACCC ACCCCGCTTT 900 

CGTAGTTTTC ATTTAGAAAA TAGAGCTTTT AAAAATGTCC TGCCTTTTAA CGTAGATATA 960 

TGCCTTCCCC CACTACCGTA AATGTCCATT TATATCATTT TTTATATATT CTTATAAAAA 1020 

20 TGTAAAAAAG AAAAACACCG CTTCTGCCTT TTCACTGTGT TGGAGTTTTC TGGAGTGAGC 1080 

ACTCACGCCC TAAGCGCACA TTCATGTGGG CATTTCTTGC GAGCCTCGCA GCCTCCGGAA 1140 

GCTGTCGACT TCATGACAAG CATTTTGTGA ACTAGGGAAG CTCAGGGGGG TTACTGGCTT 1200 

CTCTTGAGTC ACACTGCTAG CAAATGGCAG AACCAAAGCT CAAATAAAAA TAAAATAATT 1260 
TTCATTCATT CACTC 



25 



Seq ID NO: 133 Protein Sequence 
Protein Accession H r NP 478102.1 



1 11 21 31 41 51 

30 | | | | || 

MGRGRCVGPS LQLRGQEWRC SPLVPKGGAA AAELGPGGGE NMVRRFLVTL RIRRACGPPR 60 

VRVFWHIPR LTGEWAAPGA PAAVALVLML LRSQRLGQQP LPRRPGHDDG QRPSGGAAAA 120 

PRRGAQLRRP RHSHPTRARR CPGGLPGHAG GAAPGRGAAG RARCLGPSAR GPG 

35 Seq ID NO: 134 DNA sequence 

Nucleic Acid Accession #: NM_023915 
Coding sequence: 1..1077 

1 11 21 31 41 51 

40 | ill II 

ATGGGGTTCA ACTTGACGCT TGCAAAATTA CCAAATAACG AGCTGCACGG CCAAGAGAGT 60 

CACAATTCAG GCAACAGGAG CGACGGGCCA GGAAAGAACA CCACCCTTCA CAATGAATTT 120 

GACACAATTG TCTTGCCGGT GCTTTATCTC ATT AT ATT TG TGGCAAGCAT CTTGCTGAAT 180 

GGTTTAGCAG TGTGGATCTT CTTCCACATT AGGAATAAAA CCAGCTTCAT ATTCTATCTC 240 

45 AAAAACATAG TGGTTGCAGA CCTCATAATG ACGCTGACAT TTCCATTTCG AATAGTCCAT 300 

GATGCAGGAT TTGGACCTTG GTACTTCAAG TTTATTCTCT GCAGATACAC TTCAGTTTTG 360 

TTTTATGCAA ACATGTATAC TTCCATCGTG TTCCTTGGGC TGATAAGCAT TGATCGCTAT 420 

CTGAAGGTGG TCAAGCCATT TGGGGACTCT CGGATGTACA GCATAACCTT CACGAAGGTT 480 

TTATCTGTTT GTGTTTGGGT GATCATGGCT GTTTTGTCTT TGCCAAACAT CATCCTAACA 540 

50 AATGGTCAGC CAACAGAGGA CAATATCCAT GACTGCTCAA AACTTAAAAG TCCTTTGGGG 600 

GTCAAATGGC ATACGGCAGT CACCTATGTG AACAGCTGCT TGTTTGTGGC CGTGCTGGTG 660 

ATTCTGATCG GATGTTACAT AGCCATATCC AGGTACATCC ACAAATCCAG CAGGCAATTC 720 

ATAAGTCAGT CAAGCCGAAA GCGAAAACAT AACCAGAGCA TCAGGGTTGT TGTGGCTGTG 780 

TTTTTTACCT GCTTTCTACC ATATCACTTG TGCAGAATTC CTTTTACTTT TAGTCACTTA 840 

55 GACAGGCTTT TAGATGAATC TGCACAAAAA ATCCTATATT ACTGCAAAGA AATTACACTT 900 

TTCTTGTCTG CGTGTAATGT TTGCCTGGAT CCAATAATTT ACTTTTTCAT GTGTAGGTCA 960 

TTTTCAAGAA GGCTGTTCAA AAAATCAAAT ATCAGAACCA GGAGTGAAAG CATCAGATCA 1020 
CTGCAAAGTG TGAGAAGATC GGAAGTTCGC ATATATTATG ATTACACTGA TGTGTAA 

60 Seq ID NO: 135 Protein Sequence 
Protein Accession ft: NP_076404 

1 11 21 31 41 51 

,.111111 

OJ MGFNLTLAKL PNNELHGQES HNSGNRSDGP GKNTTLHNEF DTIVLPVLYL IIFVASILLN 60 

GLAVWIFFHI RNKTSFIFYL KNIWADLIM TLTFPFRIVH DAGFGPWYFK FILCRYTSVL 120 

FYANMYTSIV FLGLISIDRY LKWKPFGDS RMYSITFTKV LSVCVWVIMA VLSLPNIILT 180 

NGQPTEDNIH DCSKLKSPLG VKWHTAVTYV NSCLFVAVLV ILIGCYIAIS RYIHKSSRQF 240 

ISQSSRKRKH NQSIRVWAV FFTCFLPYHL CRIPFTFSHL DRLLDESAQK ILYYCKEITL 300 
70 FLSACNVCLD PIIYFFMCRS FSRRLFKKSN IRTRSESIRS LQSVRRSEVR IYYDYTDV 

Seq ID NO: 136 DNA sequence 
Nucleic Acid Accession ft: NM_005329.1 
75 Coding sequence: 1..1662 

1 11 21 31 41 51 

I I I I I I 
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ATGCCGGTGC AGCTGACGAC AGCCCTGCGT GTGGTGGGCA CCAGCCTGTT TGCCCTGGCA 60 

GTGCTGGGTG GCATCCTGGC AGCCTATGTG ACGGGCTACC AGTTCATCCA CACGGAAAAG X20 

CACTACCTGT CCTTCGGCCT GTACGGCGCC ATCCTGGGCC TGCACCTGCT CATTCAGAGC 180 

CTTTTTGCCT TCCTGGAGCA CCGGCGCATG CGACGTGCCG GCCAGGCCCT GAAGCTGCCC 240 

5 TCCCCGCGGC GGGGCTCGGT GGCACTGTGC ATTGCCGCGT ACCAGGAGGA CCCTGACTAC 300 

TTGCGCAAGT GCCTGCGCTC GGCCCAGCGC ATCTCCTTCC CTGACCTCAA GGTGGTCATG 360 

GTGGTGGATG GCAACCGCCA GGAGGACGCC TACATGCTGG ACATCTTCCA CGAGGTGCTG 420 

GGCGGCACCG AGCAGGCCGG CTTCTTTGTG TGGCGCAGCA ACTTCCATGA GGCAGGCGAG 480 

GGTGAGACGG AGGCCAGCCT GCAGGAGGGC ATGGACCGTG TGCGGGATGT GGTGCGGGCC S40 

10 AGCACCTTCT CGTGCATCAT GCAGAAGTGG GGAGGCAAGC GCGAGGTCAT GTACACGGCC 600 

TTCAAGGCCC TCGGCGATTC GGTGGACTAC ATCCAGGTGT GCGACTCTGA CACTGTGCTG 660 

GATCCAGCCT GCACCATCGA GATGCTTCGA GTCCTGGAGG AGGATCCCCA AGTAGGGGGA 720 

GTCGGGGGAG ATGTCCAGAT CCTCAACAAG TACGACTCAT GGATTTCCTT CCTGAGCAGC 780 

GTGCGGTACT GGATGGCCTT CAACGTGGAG CGGGCCTGCC AGTCCTACTT TGGCTGTGTG 840 

15 CAGTGTATTA GTGGGCCCTT GGGCATGTAC CGCAACAGCC TCCTCCAGCA GTTCCTGGAG 900 

GACTGGTACC ATCAGAAGTT CCTAGGCAGC AAGTGCAGCT TCGGGGATGA CCGGCACCTC 960 

ACCAACCGAG TCCTGAGCCT TGGCTACCGA ACTAAGTATA CCGCGCGCTC CAAGTGCCTC 1020 

ACAGAGACCC CCACTAAGTA CCTCCGGTGG CTCAACCAGC AAACCCGCTG GAGCAAGTCT 1080 

TACTTCCGGG AGTGGCTCTA CAACTCTCTG TGGTTCCATA AGCACCACCT CTGGATGACC 1140 

20 TACGAGTCAG TGGTCACGGG TTTCTTCCCC TTCTTCCTCA TTGCCACGGT TATACAGCTT 1200 

TTCTACCGGG GCCGCATCTG GAACATTCTC CTCTTCCTGC TGACGGTGCA GCTGGTGGGC 1260 

ATTATCAAGG CCACCTACGC CTGCTTCCTT CGGGGCAATG CAGAGATGAT CTTCATGTCC 1320 

CTCTACTCCC TCCTCTATAT GTCCAGCCTT CTGCCGGCCA AGATCTTTGC CATTGCTACC 13 80 

ATCAACAAAT CTGGCTGGGG CACCTCTGGC CGAAAAACCA TTGTGGTGAA CTTCATTGGC 1440 

25 CTCATTCCTG TGTCCATCTG GGTGGCAGTT CTCCTGGGAG GGCTGGCCTA CACAGCTTAT 1500 

TGCCAGGACC TGTTCAGTGA GACAGAGCTA GCCTTCCTTG TCTCTGGGGC TATACTGTAT 1560 

GGCTGCTACT GGGTGGCCCT CCTCATGCTA TATCTGGCCA TCATCGCCCG GCGATGTGGG 1620 
AAGAAGCCGG AGCAGTACAG CTTGGCTTTT GCTGAGGTGT GA 

30 Seq ID NO: 137 Protein Sequence 
Protein Accession 8: NP_005320.1 

1 11 21 31 41 51 

„ I I 1 ! I I 

3 J MPVQLTTALR WGTSLPAJLA VLGGILAAYV TGYQPIHTEK HYLSFGLYGA ILGLHLLIQS 60 

LFAFLEHRRM RRAGQALKLP SPRRGSVALC IAAYQEDPDY LRKCLRSAQR ISFPDLKWM 120 

WDGNRQBDA YMLDIFHEVL GGTEQAGFFV WRSNFHEAGE GETEASLQEG MDRVRDWRA 180 

STFSCIMQKW GGKREVMYTA PKALGDSVDY IQVCDSDTVL DPACTIEMLR VLEEDPQVGG 240 

VGGDVQILNK YDSWISFLSS VRYWMAFNVE RACQSYFGCV QCISGPLGMY RNSLLQQFLE 300 

40 DWYHQKFLGS KCSFGDDRHL TNRVLSLGYR TKYTARSKCL TETPTKYLRW LNQQTRWSKS 360 

YFREWLYNSL WFHKHHLWMT YESWTGFFP FFLIATVIQL FYRGRIWNIL LFLLTVQLVG 420 

IIKATYACFL RGNAEMIPMS LYSLLYMSSI* LPAKI FAIAT INKSGWGTSG RKTIWNFIG 480 

LIPVSIWVAV LLGGLAYTAY CQDLFSETEL AFLVSGAILY GCYWVALLML YLAI IARRCG 540 
KKPEQYSLAF AEV 



45 
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70 
75 



Seq ID NO: 138 DNA Sequence 

Nucleic Acid Accession #: NM_001327.1 

Coding sequence: 89.. 631 



1 11 21 31 41 51 

I I I ! I I 

AGCAGGGGGC GCTGTGTGTA CCGAGAATAC GAGAATACCT CGTGGGCCCT GACCTTCTCT 60 

CTGAGAGCCG GGCAGAGGCT CCGGAGCCAT GCAGGCCGAA GGCCGGGGCA CAGGGGGTTC 120 

55 GACGGGCGAT GCTGATGGCC CAGGAGGCCC TGGCATTCCT GATGGCCCAG GGGGCAATGC 180 

TGGCGGCCCA GGAGAGGCGG GTGCCACGGG CGGCAGAGGT CCCCGGGGCG CAGGGGCAGC 240 

AAGGGCCTCG GGGCCGGGAG GAGGCGCCCC GCGGGGTCCG CATGGCGGCG CGGCTTCAGG 300 

GCTGAATGGA TGCTGCAGAT GCGGGGCCAG GGGGCCGGAG AGCCGCCTGC TTGAGTTCTA 360 

CCTCGCCATG CCTTTCGCGA CACCCATGGA AGCAGAGCTG GCCCGCAGGA GCCTGGCCCA 420 

60 GGATGCCCCA CCGCTTCCCG TGCCAGGGGT GCTTCTGAAG GAGTTCACTG TGTCCGGCAA 480 

CATACTGACT ATCCGACTGA CTGCTGCAGA CCACCGCCAA CTGCAGCTCT CCATCAGCTC 540 

CTGTCTCCAG CAGCTTTCCC TGTTGATGTG GATCACGCAG TGCTTTCTGC CCGTGTTTTT 600 

GGCTCAGCCT CCCTCAGGGC AGAGGCGCTA AGCCCAGCCT GGCGCCCCTT CCTAGGTCAT 660 

GCCTCCTCCC CTAGGGAATG GTCCCAGCAC GAGTGGCCAG TTCATTGTGG GGGCCTGATT 720 

65 GTTTGTCGCT GGAGGAGGAC GGCTTACATG TTTGTTTCTG TAGAAAATAA AACTGAGCTA 780 



Seq ID NO: 13 9 Protein Sequence 
Protein Accession # : NP_001318.1 

1 11 21 31 41 51 

I I I I I I 

MQAEGRGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPGGGA 60 

PRGPHGGAAS GLNGCCRCGA RGPESRLLEF YLAMPFATPM EAELARRSLA QDAPPLPVPG 120 

VLLKEFTVSG NILTIRLTAA DHRQLQLSIS SCLQQLSLLM WITQCFLPVF LAQPPSGQRR 180 



Seq ID NO: 140 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 
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Coding sequence: 53.. 459 



1 11 21 

I I i 

CCTCGTGGGC CCTGACCTTC TCTCTGAGAG 
GAAGGCCAGG GCACAGGGGG TTCGACGGGC 
CCTGATGGCC CAGGGGGCAA TGCTGGCGGC 
GGTCCCCGGG GCGCAGGGGC AGCAAGGGCC 
CCGCATGGCG GTGCCGCTTC TGCGCAGGAT 
GACAGCCGCC TGCTTCAGTT CCGACTGACT 
ATCAGCTCCT GTCTCCAGCA GCTTTCCCTG 
GTGTTTTTGG CTCAGGCTCC CTCAGGGCAG 
TAGGTCATGC CTCCTCCCCT AGGGAATGGT 
GCCTGATTGT TTGTCGCTGG AGGAGGACGG 
CTGAGCTA 



31 41 51 

! I I 

CCGGGCAGAG GCTCCGGAGC CATGCAGGCC 
GATGCTGATG GCCCAGGAGG CCCTGGCATT 
CCAGGAGAGG CGGGTGCCAC GGGCGGCAGA 
TCGGGGCCGA GAGGAGGCGC CCCGCGGGGT 
GGAAGGTGCC CCTGCGGGGC CAGGAGGCCG 
GCTGCAGACC ACCGCCAACT GCAGCTCTCC 
TTGATGTGGA TCACGCAGTG CTTTCTGCCC 
AGGCGCTAAG CCCAGCCTGG CGCCCCTTCC 
CCCAGCACGA GTGGCCAGTT CATTGTGGGG 
CTTACATGTT TGTTTCTGTA GAAAATAAAG 



Seq ID NO: 141 Protein Sequence 
Protein Accession #: Bos sequence 



1 11 21 31 41 51 

I I I i I 1 

MQAEGQGTGG STGDADGPGG PGIPDGPGGN AGGPGEAGAT GGRGPRGAGA ARASGPRGGA 
PRGPHGGAAS AQDGRCPCGA RRPDSRIiLQF RLTAADHRQL QLSISSCLQQ LSLLMWITQC 
FLPVFLAQAP SGQRR 



Seq ID NO: 142 DNA Sequence 

Nucleic Acid Accession #: NM__130467.2 

Coding sequence: 2 4 6.. 63 8 

1 11 21 31 41 51 

I 1 I I I I 

AGGCAGAGCT CTGCAAGGAG AGGTTGTGTC TTCGTTCTTT CCGCCATCTT CGTTCTTTCC 
AACATCTTCG TTCTTTCTCA CTGACCGAGA CTCAGCCGGT AGGTCTGCAG AGTGGTCTTC 
CTGGTAATTT AGTTGTGAGT GAATGTGTGG AGGAGCCAGC GGGCTTAGGA CAGGTCCTGT 
GGCACAGTCC GTGGCTTTGA GGGAAAAGGG CCTCGCGGTG GTCCTCCGCC TTCCCCCAGG 
TCGTGATGCA GGCGCCATGG GCCGGTAATC GTGGCTGGGC TGGAACGAGG GAGGAAGTGA 
GAGATATGAG TGAGCATGTA ACAAGATCCC AATCCTCAGA AAGAGGAAAT GACCAAGAGT 
CTTCCCAGCC AGTTGGACCT GTGATTGTCC AGCAGCCCAC TGAGGAAAAA CGTCAAGAAG 
AGGAACCACC AACTGATAAT CAGGGTATTG CACCTAGTGG GGAGATCAAA AATGAAGGAG 
CACCTGCTGT TCAAGGGACT GATGTGGAAG CTTTTCAACA GGAACTGGCT CTGCTTAAGA 
TAGAGGATGC ACCTGGAGAT GGTCCTGATG TCAGGGAGGG GACTCTGCCC ACTTTTGATC 
CCACTAAAGT GCTGGAAGCA GGTGAAGGGC AACTATAGGT TTAAACCAAG ACAAATGAAG 
ACTGAAACCA AGAATATTGT TCTTATGCTG GAAATTTGAC TGCTAACATT CTCTTAATAA 
AGTTTTACAG TTTTC 



Seq ID NOi 143 Protein Sequence 
Protein Accession fh NP_569734 

1 11 21 31 41 51 

I I I i I I 

HSEKVTRSQS SERGNDQESS QPVGPVIVQQ PTEEKRQEEE PPTDNQGIAP SGEIKNEGAP 
AVQGTDVEAF QQELALLKIE DAPGDGPDVR EGTLPTFDPT KVLEAGEGQL 



Seq ID NO: 144 DNA Sequence 

Nucleic Acid Accession #: NM_001476.1 

Coding sequence: 82.. 43 5 

1 11 21 31 41 51 

I I ! I 1 I 

GCCAGGGAGC TGTGAGGCAG TGCTGTGTGG TTCCTGCCGT CCGGACTCTT TTTCCTCTAC 
TGAGATTCAT CTGTGTGAAA TATGAGTTGG CGAGGAAGAT CGACCTATTA TTGGCCTAGA 
CCAAGGCGCT ATGTACAGCC TCCTGAAGTG ATTGGGCCTA TGCGGCCCGA GCAGTTCAGT 
GATGAAGTGG AACCAGCAAC ACCTGAAGAA GGGGAACCAG CAACTCAACG TCAGGATCCT 
GCAGCTGCTC AGGAGGGAGA GGATGAGGGA GCATCTGCAG GTCAAGGGCC GAAGCCTGAA 
GCTGATAGCC AGGAACAGGG TCACCCACAG ACTGGGTGTG AGTGTGAAGA TGGTCCTGAT 
GGGCAGGAGG TGGACCCGCC AAATCCAGAG GAGGTGAAAA CGCCTGAAGA AGGTGAAAAG 
CAATCACAGT GTTAAAAGAA GACACGTTGA AATGATGCAG GCTGCTCCTA TGTTGGAAAT 
TTGTTCATTA AAATTCTCCC AATAAAGCTT TACAGCCTTC TGCAAAA 



Seq ID NO: 145 Protein Sequence 
Protein Accession #: NP_001467.1 



X 11 21 31 41 51 

I I i I I I 

MSWRGRSTYY WPRPRRWQP PEVIGPMRPE QFSDEVEPAT PEEGEPATQR QDPAAAQEGE 
DEGASAGOGP KPEADSQEQG HPQTGCECED GPDGQEVDPP NPEEVKTPEE GEXQSQC 
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WO 03/003906 
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Seq ID NOi 146 DNA Sequence 
Nucleic Acid Accession 8: NM_00S562 
5 Coding sequence: 90.. 3671 

1 11 21 31 41 51 

I I I I I I 

ACAGCGGAGC GCAGAGTGAG AACCACCAAC CGAGGCGCCG GGCAGCGACC CCTGCAGCGG 60 

10 AGACAGAGAC TGAGCGGCCC GGCACCGCCA TGCCTGCGCT CTGGCTGGGC TGCTGCCTCT 120 

GCTTCTCGCT CCTCCTGCCC GCAGCCCGGG CCACCTCCAG GAGGGAAGTC TGTGATTGCA 180 

ATGGGAAGTC CAGGCAGTGT ATCTTTGATC GGGAACTTCA CAGACAAACT GGTAATGGAT 240 

TCCGCTGCCT CAACTGCAAT GACAACACTG ATGGCATTCA CTGCGAGAAG TGCAAGAATG 300 

GCTTTTACCG GCACAGAGAA AGGGACOGCT GTTTGCCCTG CAATTGTAAC TCCAAAGGTT 360 

15 CTCTTAGTGC TCGATGTGAC AACTCTGGAC GGTGCAGCTG TAAACCAGGT GTGACAGGAG 420 

CCAGATGCGA CCGATGTCTG CCAGGCTTCC ACATGCTCAC GGATGCGGGG TGCACCCAAG 480 

ACCAGAGACT GCTAGACTCC AAGTGTGACT GTGACCCAGC TGGCATCGCA GGGCCCTGTG 540 

ACGCGGGCCG CTGTGTCTGC AAGCCAGCTG TTACTGGAGA ACGCTGTGAT AGGTGTCGAT 600 

CAGGTTACTA TAATCTGGAT GGGGGGAACC CTGAGGGCTG TACCCAGTGT TTCTGCTATG 660 

20 GGCATTCAGC CAGCTGCCGC AGCTCTGCAG AATACAGTGT CCATAAGATC ACCTCTACCT 720 

TTCATCAAGA TGTTGATGGC TGGAAGGCTG TCCAACGAAA TGGGTCTCCT GCAAAGCTCC 780 

AATGGTCACA GCGCCATCAA GATGTGTTTA GCTCAGCCCA ACGACTAGAC CCTGTCTATT 840 

TTGTGGCTCC TGCCAAATTT CTTGGGAATC AACAGGTGAG CTATGGGCAA AGCCTGTCCT 900 

TTGACTACCG TGTGGACAGA GGAGGCAGAC ACCCATCTGC CCATGATGTG ATTCTGGAAG 960 

25 GTGCTGGTCT ACGGATCACA GCTCCCTTGA TGCCACTTGG CAAGACACTG CCTTGTGGGC 1020 

TCACCAAGAC TTACACATTC AGGTTAAATG AGCATCCAAG CAATAATTGG AGCCCCCAGC 1080 

TGAGTTACTT TGAGTATCGA AGGTTACTGC GGAATCTCAC AGCCCTCCGC ATCCGAGCTA 1140 

CATATGGAGA ATACAGTACT GGGTACATTG ACAATGTGAC CCTGATTTCA GCCCGCCCTG 1200 

TCTCTGGAGC CCCAGCACCC TGGGTTGAAC AGTGTATATG TCCTGTTGGG TACAAGGGGC 1260 

30 AATTCTGCCA GGATTGTGCT TCTGGCTACA AGAGAGATTC AGCGAGACTG GGGCCTTTTG 1320 

GCACCTGTAT TCCTTGTAAC TGTCAAGGGG GAGGGGCCTG TGATCCAGAC ACAGGAGATT 1380 

GTTATTCAGG GGATGAGAAT CCTGACATTG AGTGTGCTGA CTGCCCAATT GGTTTCTACA 1440 

ACGATCCGCA CGACCCCCGC AGCTGCAAGC CATGTCCCTG TCATAACGGG TTCAGCTGCT 1500 

CAGTGATGCC GGAGACGGAG GAGGTGGTGT GCAATAACTG CCCTCCCGGG GTCACCGGTG 1560 

35 CCCGCTGTGA GCTCTGTGCT GATGGCTACT TTQGGGACCC CTTTGGTGAA CATGGCCCAG 1620 

TGAGGCCTTG TCAGCCCTGT CAATGCAACA ACAATGTGGA CCCCAGTGCC TCTGGGAATT 1680 

GTGACCGGCT GACAGGCAGG TGTTTGAAGT GTATCCACAA CACAGCCGGC ATCTACTGCG 1740 

ACCAGTGCAA AGCAGGCTAC TTCGGGGACC CATTGGCTCC CAACCCAGCA GACAAGTGTC 1800 

GAGCTTGCAA CTGTAACCCC ATGGGCTCAG AGCCTGTAGG ATGTCGAAGT GATGGCACCT 1860 

40 GTGTTTGCAA GCCAGGATTT GGTGGCCCCA ACTGTGAGCA TGGAGCATTC AGCTGTCCAG 1920 

CTTGCTATAA TCAAGTGAAG ATTCAGATGG ATCAGTTTAT GCAGCAGCTT CAGAGAATGG 1980 

AGGCCCTGAT TTCAAAGGCT CAGGGTGGTG ATGGAGTAGT ACCTGATACA GAGCTGGAAG 2040 

GCAGGATGCA GCAGGCTGAG CAGGCCCTTC AGGACATTCT GAGAGATGCC CAGATTTCAG 2100 

AAGGTGCTAG CAGATCCCTT GGTCTCCAGT TGGCCAAGGT GAGGAGCCAA GAGAACAGCT 2160 

45 ACCAGAGCCG CCTGGATGAC CTCAAGATGA CTGTGGAAAG AGTTCGGGCT CTGGGAAGTC 2220 

AGTACCAGAA CCGAGTTCGG GATACTCACA GGCTCATCAC TCAGATGCAG CTGAGCCTGG 2280 

CAGAAAGTGA AGCTTCCTTG GGAAACACTA ACATTCCTGC CTCAGACCAC TACGTGGGGC 2340 

CAAATGGCTT TAAAAGTCTG GCTCAGGAGG CCACAAGATT AGCAGAAAGC CACGTTGAGT 2400 

t CAGCCAGTAA CATGGAGCAA CTGACAAGGG AAACTGAGGA CTATTCCAAA CAAGCCCTCT 2460 

50 CACTGGTGCG CAAGGCCCTG CATGAAGGAG TCGGAAGCGG AAGCGGTAGC CCGGACGGTG 2520 

CTGTGGTGCA AGGGCTTGTG GAAAAATTGG AGAAAACCAA GTCCCTGGCC CAGCAGTTGA 2580 

CAAGGGAGGC CACTCAAGCG GAAATTGAAG CAGATAGGTC TTATCAGCAC AGTCTCCGCC 2640 

TCCTGGATTC AGTGTCTCGG CTTCAGGGAG TCAGTGATCA GTCCTTTCAG GTGGAAGAAG 2700 

CAAAGAGGAT CAAACAAAAA GCGGATTCAC TCTCAACGCT GGTAACCAGG CATATGGATG 2760 

55 AGTTCAAGCG TACACAAAAG AATCTGGGAA ACTGGAAAGA AGAAGCACAG CAGCTCTTAC 2820 

AGAATGGAAA AAGTGGGAGA GAGAAATCAG ATCAGCTGCT TTCCCGTGCC AATCTTGCTA 2880 

AAAGCAGAGC ACAAGAAGCA CTGAGTATGG GCAATGCCAC TTTTTATGAA GTTGAGAGCA 2940 

TCCTTAAAAA CCTCAGAGAG TTTGACCTGC AGGTGGACAA CAGAAAAGCA GAAGCTGAAG 3000 

^ AAGCCATGAA GAGACTCTCC TACATCAGCC AGAAGGTTTC AGATGCCAGT GACAAGACCC 3060 

60 AGCAAGCAGA AAGAGCCCTG GGGAGCGCTG CTGCTGATGC ACAGAGGGCA AAGAATGGGG 3120 

CCGGGGAGGC CCTGGAAATC TCCAGTGAGA TTGAACAGGA GATTGGGAGT CTGAACTTGG 3180 

AAGCCAATGT GACAGCAGAT GGAGCCTTGG CCATGGAAAA GGGACTGGCC TCTCTGAAGA 3240 

GTGAGATGAG GGAAGTGGAA GGAGAGCTGG AAAGGAAGGA GCTGGAGTTT GACACGAATA 3300 

TGGATGCAGT ACAGATGGTG ATTACAGAAG CCCAGAAGGT TGATACCAGA GCCAAGAACG 3360 

65 CTGGGGTTAC AATCCAAGAC ACACTCAACA CATTAGACGG CCTCCTGCAT CTGATGGACC 3420 

AGCCTCTCAG TGTAGATGAA GAGGGGCTGG TCTTACTGGA GCAGAAGCTT TCCCGAGCCA 3480 

AGACCCAGAT CAACAGCCAA CTGGGGCCCA TGATGTCAGA GCTGGAAGAG AGGGCACGTC 3540 

AGCAGAGGGG CCACCTCCAT TTGCTGGAGA CAAGCATAGA TGGGATTCTG GCTGATGTGA 3600 

AGAACTTGGA GAACATTAGG GACAACCTGC CCCCAGGCTG CTACAATACC CAGGCTCTTG 3660 

70 AGCAACAGTG AAGCTGCCAT AAATATTTCT CAACTGAGGT TCTTGGGATA CAGATCTCAG 3720 

GGCTCGGGAG CCATGTCATG TGAGTGGGTG GGATGGGGAC ATTTGAACAT GTTTAATGGG 3780 

TATGCTCAGG TCAACTGACC TGACCCCATT CCTGATCCCA TGGCCAGGTG GTTGTCTTAT 3840 

TGCACCATAC TCCTTGCTTC CTGATGCTGG GCAATGAGGC AGATAGCACT GGGTGTGAGA 3900 

_ _ ATGATCAAGG ATCTGGACCC CAAAGAATAG ACTGGATGGA AAGACAAACT GCACAGGCAG 3960 

75 ATGTTTGCCT CATAATAGTC GTAAGTGGAG TCCTGGAATT TGGACAAGTG CTGTTGGGAT 4020 

ATAGTCAACT TATTCTTTGA GTAATGTGAC TAAAGGAAAA AACTTTGACT TTGCCCAGGC 4080 

ATGAAATTCT TCCTAATGTC AGAACAGAGT GCAACCCAGT CACACTGTGG CCAGTAAAAT 4140 

ACTATTGCCT CATATTGTCC TCTGCAAGCT TCTTGCTGAT CAGAGTTCCT CCTACTTACA 4200 
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ACCCAGGGTG TGAACATGTT CTCCATTTTC AAGCTGGAAG AAGTGAGCAG TGTTGGAGTG 4260 

AGGACCTGTA AGGCAGGCCC ATTCAGAGCT ATGGTGCTTG CTGGTGCCTG CCACCTTCAA 4320 

GTTCTGGACC TGGGCATGAC ATCCTTTCTT TTAATGATGC CATGGCAACT TAGAGATTGC 4380 

ATTTTTATTA AAGCATTTCC TACCAGCAAA GCAAATGTTG GGAAAGTATT TACTTTTTCG 4440 

5 GTTTCAAAGT GATAGAAAAG TGTGGCTTGG GCATTGAAAG AGGTAAAATT CTCTAGATTT 4500 

ATTAGTCCTA ATTCAATCCT ACTTTTCGAA CACCAAAAAT GATGCGCATC AATGTATTTT 4560 

ATCTTATTTT CTCAATCTCC TCTCTCTTTC CTCCACCCAT AATAAGAGAA TGTTCCTACT 4620 

CACACTTCAG CTGGGTCACA TCCATCCCTC CATTCATCCT TCCATCCATC TTTCCATCCA 4680 

TTACCTCCAT CCATCCTTCC AACATATATT TATTGAGTAC CTACTGTGTG CCAGGGGCTG 4740 

10 GTGGGACAGT GGTGACATAG TCTCTGCCCT CATAGAGTTG ATTGTCTAGT GAGGAAGACA 4 BOO 

AGCATTTTTA AAAAATAAAT TTAAACTTAC AAACTTTGTT TGTCACAAGT GGTGTTTATT 4860 

GCAATAACCG CTTGGTTTGC AACCTCTTTG CTCAACAGAA CATATGTTGC AAGACCCTCC 4920 

CATGGGGGCA CTTGAGTTTT GGCAAGGCTG ACAGAGCTCT GGGTTGTGCA CATTTCTTTG 4980 

CATTCCAGCT GTCACTCTGT GCCTTTCTAC AACTGATTGC AACAGACTGT TGAGTTATGA 5040 

15 TAACACCAGT GGGAATTGCT GGAGGAACCA GAGGCACTTC CACCTTGGCT GGGAAGACTA 5100 

TGGTGCTGCC TTGCTTCTGT ATTTCCTTGG ATTTTCCTGA AAGTGTTTTT AAATAAAGAA 5160 
CAATTGTTAG ATGCC 

Seq ID NO: 147 Protein Sequence 
20 Protein Accession g: NPJJ05553 

1 11 21 31 41 51 

I I I I I I 

MPALWU3CCL CFSKLLPAAR ATSRREVCDC NGKSRQCIFD REIiHRQTGNG FRCLNCNDNT 60 

25 DGIHCEKCKN GFYRHRERDR CLPCNCNSKG SLSARCDNSG RCSCKPGVTG ARCDRCLPGF 120 

HMLTDAGCTQ DQRLLDSKCD CDPAGIAGPC DAGRCVCKPA VTGERCDRCR SGYYNLDGGN 180 

PEGCTQCFCY GHSASCRSSA BYSVHKITST FHQDVDGWKA VQRNGSPAKL QWSQRHQDVF 240 

SSAQRLDPVY FVAPAKFLGN QQVSYGQSLS FDYRVDRGGR HPSAHOVILB GAGLRITAPL 300 

MPLGKTLPCG LTKTYTFRLN EHPSNNWSPQ LSYFEYRRLL RNLTALRIRA TYGEYSTGYI 360 

30 DNVTLISARP VSGAPAPWVE QCICPVGYKG QFCQDCASGY KRDSARLGPF GTCIPCNCQG 420 

GGACDPDTGO CYSGDENPDI ECADCPIGFY NDPHDPRSCK PCPCHNGFSC SVMPETEEW 480 

CNNCPPGVTG ARCELCADGY FGDPFGEHGP VRPCQPCQCN NNVDPSASGtf CDRLTGRCLK 540 

CIHNTAGIYC DQCKAGYFGD PLAPNPADKC RACNCNPMGS EPVGCRSDGT CVCKPGFGGP 600 

NCEHGAFSCP ACYNQVKIQM DQFMQQLQRM EALISKAQGG DGWPDTELE GRMQQAEQAL 660 

35 QDILRDAQIS EGASRSLGLQ LAKVRSQENS YQSRLDDLKM TVERVRALGS QYQNRVRDTH 720 

RLITOMQLSL AESEASLGNT NIPASDHYVG PNGFKSLAQE ATRIiAESHVB SASNMEQLTR 780 

ETEDYSKQAL SLVRKALHEG VGSGSGSPDG AWQGLVEKL EKTKSLAQQI* TREATQAEIE 840 

ADRSYQHSLR LLDSVSRLQG VSDQSFQVEE AKRIKQKADS LSTLVTRHMD EFKRTQKNLG 900 

NWKEEAQQL.L QNGKSGREKS DQLLSRANLA KSRAQEALSM GNATFYEVES ILKNLREFDL 960 

40 QVDNRKAEAE EAMKRLSYIS QKVSDASDKT QQAERALGSA AADAQRAKNG AGEALBISSE 1020 

IEQEIGSLNL EANVTADGAL AMEKGLASLK SEMREVEGEL ERKELEFDTN MDAVQMVITE 1080 

AQKVDTRAKN AGVTIQDTLN TLDGLLHLMD QPLSVDEEGL VLLEQKLSRA KTQINSQLRP 1140 
MMSELEERAR QQRGHLHLLE TSIDGILADV KNLENIRDNL PPGCYNTQAL EQQ 

45 Seq ID NO: 148 DNA Sequence 

Nucleic Acid Accession »: NM_021048.2 
Coding sequence: 327.. 1436 

1 11 21 31 41 51 

50 | ill I I 

GGCACGAGGG AGAAGCGAGG TTCTCGTTCT GAGGGACAGG CTTGAGATCG GCTGAAGAGA 60 

GCGGGCCCAG GCTCTGTGAG GAGGCAAGGG AGGTGAGAAC CTTGCTCTCA GAGGGTGACT 120 

CAAGTCAACA CAGGGAACCC CTCTTTTCTA CAGACACAGT GGGTCGCAGG ATCTGACAAG 180 

AGTCCAGGTT CTCAGGGGAC AGGGAGAGCA AGAGGTCAAG AGCTGTGGGA CACCACAGAG 240 

55 CAGCACTGAA GGAGAAGACC TGCCTGTGGG TCCCCATCGC CCAAGTCCTG CCCACACTCC 300 

CACCTGCTAC CCTGATCAGA GTCATCATGC CTCGAGCTCC AAAGCGTCAG CGCTGCATGC 360 

CTGAAGAAGA TCTTCAATCC CAAAGTGAGA CACAGGGCCT CGAGGGTGCA CAGGCTCCCC 420 

TGGCTGTGGA GGAGGATGCT TCATCATCCA CTTCCACCAG CTCCTCTTTT CCATCCTCTT 480 

TTCCCTCCTC CTCCTCTTCC TCCTCCTCCT CCTGCTATCC TCTAATACCA AGCACCCCAG 540 

60 AGGAGGTTTC TGCTGATGAT GAGACACCAA ATCCTCCCCA GAGTGCTCAG ATAGCCTGCT 600 

CCTCCCCCTC GGTCGTTGCT TCCCTTCCAT TAGATCAATC TGATGAGGGC TCCAGCAGCC 660 

AAAAGGAGGA GAGTCCAAGC ACCCTACAGG TCCTGCCAGA CAGTGAGTCT TTACCCAGAA 720 

GTGAGATAGA TGAAAAGGTG ACTGATTTGG TGCAGTTTCT GCTCTTCAAG TATCAAATGA 780 

el _ AGGAGCCGAT CACAAAGGCA GAAATACTGG AGAGTGTCAT AAAAAATTAT GAAGACCACT 840 • 

65 TCCCTTTGTT GTTTAGTGAA GCCTCCGAGT GCATGCTGCT GGTCTTTGGC ATTGATGTAA 900 

AGGAAGTGGA TCCCACTGGC CACTCCTTTG TCCTTGTCAC CTCOCTGGGC CTCACCTATG 960 

ATGGGATGCT GAGTGATGTC CAGAGCATGC CCAAGACTGG CATTCTCATA CTTATCCTAA 1020 

GCATAATCTT CATAGAGGGC TACTGCACCC CTGAGGAGGT CATCTGGGAA GCACTGAATA 1080 

TGATGGGGCT GTATGATGGG ATGGAGCACC TCATTTATGG GGAGCCCAGG AAGCTGCTCA 1140 

70 CCCAAGATTG GGTGCAGGAA AACTACCTGG AGTACCGGCA GGTGCCTGGC AGTGATCCTG 1200 

CACGGTATGA GTTTCTGTGG GGTCCAAGGG CTCATGCTGA AATTAGGAAG ATGAGTCTCC 1260 

TGAAATTTTT GGCCAAGGTA AATGGGAGTG ATCCAAGATC CTTCCCACTG TGGTATGAGG 1320 

AGGCTTTGAA AGATGAGGAA GAGAGAGCCC AGGACAGAAT TGCCACCACA GATGATACTA 1380 

CTGCCATGGC CAGTGCAAGT TCTAGCGCTA CAGGTAGCTT CTCCTACCCT GAATAAAGTA 1440 

75 AGACAGATTC TTCACTGTGT TTTAAAAGGC AAGTCAAATA CCACATGATT TTACTCATAT 1500 

GTGGAATCTA AAAAAAAAAA AAAAAAAA 
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Seq ID NO: 149 Protein Sequence 
Protein Accession 8: NP_066386.1 

1 11 21 31 41 51 

5 | I | | I I 

MPRAPKRQRC MPEEDLQSQS ETQGLEGAQA PLAVEEDASS STSTSSSPPS SFPSSSSSSS 60 
SSCYPLIPST PEEVSADDET PNPPQSAQIA CSSPSWASL PLDQSDEGSS SQKEESPSTL 120 
QVLPDSESLP RSEIDEKVTD LVQFLLFKYQ MKEPITKABI LESVIKNYED HFPLLFSEAS 180 
ECMLLVFGID VKEVDPTGHS FVLVTSLGLT YDGMLSDVQS MPKTGILILI LSIIFIEGYC 240 
10 TPEEVIWEAL NMMGLYDGME HLIYGEPRKL LTQDWVQENY LEYRQVPGSD PARYEFLWGP 300 
RAHAEIRKMS LLKFLAKVNG SDPRSFPLWY EEALKDEEER AQDRIATTDD TTAMASASSS 360 
ATGSFSYPE 

Seq ID NOt 150 DNA Sequence 
15 Nucleic Acid Accession #: NM_0036S5 
Coding sequence: 12.. 398 

1 11 21 31 41 51 

on 1 1 1 1 1 1 

/U CGACATCAGA GATGAGGACA GCATTGCTGC TCCTTGCAGC CCTGGCTGTG GCTACAGGGC 60 

CAGCCCTTAC CCTGCGCTGC CACGTGTGCA CCAGCTCCAG CAACTGCAAG CATTCTGTGG 120 

TCTGCCCGGC CAGCTCTCGC TTCTGCAAGA CCACGAACAC AGTGGAGCCT CTGAGGGGGA 180 

ATCTGGTGAA GAAGGACTGT GCGGAGTCGT GCACACCCAG CTACACCCTG CAAGGCCAGG 240 

TCAGCAGCGG CACCAGCTCC ACCCAGTGCT GCCAGGAGGA CCTGTGCAAT GAGAAGCTGC 300 

25 ACAACGCTGC ACCCACCCGC ACCGCCCTCG CCCACAGTGC CCTCAGCCTG GGGCTGGCCC 360 

TGAGCCTCCT GGCCGTCATC TTAGCCCCCA GCCTGTGACC TTCCCCCCAG GGAAGGCCCC 420 

TCATGCCTTT CCTTCCCTTT CTCTGGGGAT TCCACACCTC TCTTCCCCAG CCGGCAACGG 480 

GGGTGCCAGG AGCCCCAGGC TGAGGGCTTC CCCGAAAGTC TGGGACCAGG TCCAGGTGGG 540 

CATGGAATGC TGATGACTTG GAGCAGGCCC CACAGACCCC ACAGAGGATG AAGCCACCCC 600 

30 ACAGAGGATG CAGCCCCCAG CTGCATGGAA GGTGGAGGAC AGAAGCCCTG TGGATCCCCG 660 

GATTTCACAC TCCTTCTGTT TTGTTGCCGT TTATTTTGTA CTCAAATCTC TACATGGAGA 720 
TAAATGATTT AAACC 

Seq ID NO: 151 Protein Sequence 
35 Protein Accession ftt NP_003686 

1 11 21 31 41 51 

I I I I I I 

MRTALLbLAA LAVATGPALT LRCHVCTSSS NCKHSWCPA SSRFCKTTNT VEPLRGNLVK 60 
40 KDCAESCTPS YTLQGQVSSG TSSTQCCQED LCNEKLHNAA PTRTALAHSA LSLGLALSLL 120 
AVILAPSL 

Seq ID NO: 152 DNA Sequence 
Nucleic Acid Accession 8: NMJJ06398.1 
45 Coding sequence: 19.. 516 

1 11 21 31 41 51 

i I I I I I 

GGCCCCTTGT CTGCAGAGAT GGCTCCCAAT GCTTCCTGCC TCTGTGTGCA TGTCCGTTCC 60 

50 GAGGAATGGG ATTTAATGAC CTTTGATGCC AACCCATATG ACAGOGTGAA AAAAATCAAA 120 

GAACATGTCC GGTCTAAGAC CAAGGTTCCT GTGCAGGACC AGGTTCTTTT GCTGGGCTCC 180 

AAGATCTTAA AGCCACGGAG AAGCCTCTCA TCTTATGGCA TTGACAAAGA GAAGACCATC 240 

CACCTTACCC TGAAAGTGGT GAAGCCCAGT GATGAGGAGC TGCCCTTGTT TCTTGTGGAG 300 

TCAGGTGATG AGGCAAAGAG GCACCTCCTC CAGGTGCGAA GGTCCAGCTC AGTGGCACAA 360 

55 GTGAAAGCAA TGATCGAGAC TAAGACGGGT ATAATCCCTG AGACCCAGAT TGTGACTTGC 420 

AATGGAAAGA GACTGGAAGA TGGGAAGATG ATGGCAGATT ACGGCATCAG AAAGGGCAAC 480 

TTACTCTTCC TGGCATCTTA TTGTATTGGA GGGTGACCAC CCTGGGGATG GGGTGTTGGC 540 

AGGGGTCAAA AAGCTTATTT CTTTTAATCT CTTACTCAAC GAACACATCT TCTGATGATT 600 

TCCCAAAATT AATGAGAATG AGATGAGTAG AGTAAGATTT GGGTGGGATG GGTAGGATGA 660 

60 AGTATATTGC CCAACTCTAT GTTTCTTTGA TTCTAACACA ATTAATTAAG TGACATGATT 720 
TTTACTAATG TATTACTGAG ACTAGTAAAT AAATTTTTAA GGCAAAATAG AGCATTC 



65 



75 



Seq ID NO: 153 Protein Sequence 
Protein Accession $t NP_006389.l 



1 11 21 31 41 51 

I I i I I I 

MAPNASCLCV HVRSEEWDLM TFDANPYDSV KKIKEHVRSK TKVPVQDQVL LLGSKILKPR 60 
RSLSSYGIDK EJCTIHLTLKV VKPSDEELPL FLVESGDEAK RHLLQVRRSS SVAQVKAMIE 120 
70 TKTGIIPETQ IVTCNGKRLE DGKMMADYGI RKGNLLFLAS YCIGG 



Seq ID NO: 154 DNA Sequence 
Nucleic Acid Accession 8 : BC017490.1 
Coding sequence: 74.. 2788 

1 11 21 31 41 51 

I I I I I I 

GTGGGTCACG TGAACCACTT TTCGCGCGAA ACCTGGTTGT TGCTGTAGTG GCGGAGAGGA 
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TCGTGGTACT GCTATGGCGG AATCATCGGA ATCCTTCACC ATGGCATCCA GCCCGGCCCA 120 

GCGTCGGCGA GGCAATGATC CTCTCACCTC CAGCCCTGGC CGAAGCTCCC GGCGTACTGA 180 

TGCCCTCACC TCCAGCCCTG GCCGTGACCT TCCACCATTT GAGGATGAGT CCGAGGGGCT 240 

CCTAGGCACA GAGGGGCCCC TGGAGGAAGA AGAGGATGGA GAGGAGCTCA TTGGAGATGG 300 

5 CATGGAAAGG GACTACCGCG CCATCCCAGA GCTGGACGCC TATGAGGCCG AGGGACTGGC 360 

TCTGGATGAT GAGGACGTAG AGGAGCTGAC GGCCAGTCAG AGGGAGGCAG CAGAGCGGGC 420 

CATGCGGCAG CGTGACCGGG AGGCTGGCOG GGGCCTGGGC CGCATGCGCC GTGGGCTCCT 480 

GTATGACAGC GATGAGGAGG ACGAGGAGCG CCCTGCCCGC AAGCGCCGCC AGGTGGAGCG 540 

GGCCACGGAG GACGGCGAGG AGGACGAGGA GATGATCGAG AGCATCGAGA ACCTGGAGGA 600 

10 TCTCAAAGGC CACTCTGTGC GCGAGTGGGT GAGCATGGCG GGCCCCCGGC TGGAGATCCA 660 

CCACCGCTTC AAGAACTTCC TGCGCACTCA CGTCGACAGC CACGGCCACA ACGTCTTCAA 720 

GGAGCGCATC AGCGACATGT GCAAAGAGAA CCGTGAGAGC CTGGTGGTGA ACTATGAGGA 780 

CTTGGCAGCC AGGGAGCACG TGCTGGCCTA CTTCCTGCCT GAGGCACCGG CGGAGCTGCT 840 

GCAGATCTTT GATGAGGCTG CCCTGGAGGT GGTACTGGCC ATGTACCCCA AGTACGACCG 900 

15 CATCACCAAC CACATCCATG TCCGCATCTC CCACCTGCCT CTGGTGGAGG AGCTGCGCTC 960 

GCTGAGGCAG CTGCATCTGA ACCAGCTGAT CCGCACCAGT GGGGTGGTGA CCAGCTGCAC 1020 

TGGCGTCCTG CCCCAGCTCA GCATGGTCAA GTACAACTGC AACAAGTGCA ATTTCGTCCT 1080 

GGGTCCTTTC TGCCAGTCCC AGAACCAGGA GGTGAAACCA GGCTCCTGTC CTGAGTGCCA 1140 

GTCGGCCGGC CCCTTTGAGG TCAACATGGA GGAGACCATC TATCAGAACT ACCAGCGTAT 1200 

20 CCGAATCCAG GAGAGTCCAG GCAAAGTGGC GGCTGGCCGG CTGCCCCGCT CCAAGGACGC 1260 

CATTCTCCTC GCAGATCTGG TGGACAGCTG CAAGCCAGGA GACGAGATAG AGCTGACTGG 1320 

CATCTATCAC AACAACTATG ATGGCTCCCT CAACACTGCC AATGGCTTCC CTGTCTTTGC 1380 

CACTGTCATC CTAGCCAACC ACGTGGCCAA GAAGGACAAC AAGGTTGCTG TAGGGGAACT 1440 

GACCGATGAA GATGTGAAGA TGATCACTAG CCTCTCCAAG GATCAGCAGA TCGGAGAGAA 1500 

25 GATCTTTGCC AGCATTGCTC CTTCCATCTA TGGTCATGAA GACATCAAGA GAGGCCTGGC 1560 

TCTGGCCCTG TTCGGAGGGG AGCCCAAAAA CCCAGGTGGC AAGCACAAGG TACGTGGTGA 1620 

TATCAACGTG CTCTTGTGCG GAGACCCTGG CACAGCGAAG TCGCAGTTTC TCAAGTATAT 1680 

TGAGAAAGTG TCCAGCCGAG CCATCTTCAC CACTGGCCAG GGGGCGTCGG CTGTGGGCCT 1740 

CACGGCGTAT GTCCAGCGGC ACCCTGTCAG CAGGGAGTGG ACCTTGGAGG CTGGGGCCCT 1800 

30 GGTTCTGGCT GACCGAGGAG TGTGTCTCAT TGATGAATTT GACAAGATGA ATGACCAGGA 1860 . 

CAGAACCAGC ATCCATGAGG CCATGGAGCA ACAGAGCATC TCCATCTCGA AGGCTGGCAT 1920 

CGTCACCTCC CTGCAGGCTC GCTGCAOGGT CATTGCTGCC GCCAACCCCA TAGGAGGGCG 1980 

CTACGACCCC TCGCTGACTT TCTCTGAGAA CGTGGACCTC ACAGAGCCCA TCATCTCACG 2040 

CTTTGACATC CTGTGTGTGG TGAGGGACAC CGTGGACCCA GTCCAGGACG AGATGCTGGC 2100 

35 CCGCTTCGTG GTGGGCAGCC ACGTCAGACA CCACCCCAGC AACAAGGAGG AGGAGGGGCT 2160 

GGCCAATGGC AGCGCTGCTG AGCCCGCCAT GCCCAACACG TATGGCGTGG AGCCCCTGCC 2220 

CCAGGAGGTC CTGAAGAAGT ACATCATCTA CGCCAAGGAG AGGGTCCACC CGAAGCTCAA 2280 

CCAGATGGAC CAGGACAAGG TGGCCAAGAT GTACAGTGAC CTGAGGAAAG AATCTATGGC 2340 

GACAGGCAGC ATCCCCATTA CGGTGCGGCA CATCGAGTCC ATGATCCGCA TGGCGGAGGC 2400 

40 CCACGCGCGC ATCCATCTGC GGGACTATGT ' GATCGAAGAC GACGTCAACA TGGCCATCCG 2460 

CGTGATGCTG GAGAGCTTCA TAGACACACA GAAGTTCAGC GTCATGCGCA GCATGGGCAA 2520 

GACTTTTGCC CGCTACCTTT CATTCCGGCG TGACAACAAT GAGCTGTTGC TCTTCATACT 2580 

GAAGCAGTTA GTGGCAGAGC AGGTGACATA TCAGCGCAAC CGCTTTGGGG CCCAGCAGGA 2640 

CACTATTGAG GTCCCTGAGA AGGACTTGGT GGATAAGGCT CGTCAGATCA ACATCCACAA 2700 

45 CCTCTCTGCA TTTTATGACA GTGAGCTCTT CAGGATGAAC AAGTTCAGCC ACGACCTGAA 2760 

AAGGAAAATG ATCCTGCAGC AGTTCTGAGG CCCTATGCCA TCCATAAGGA TTCCTTGGGA 2820 

TTCTGGTTTG GGGTGGTCAG TGCCCTCTGT GCTTTATGGA CACAAAACCA GAGCACTTGA 2880 

TGAACTCGGG GTACTAGGGT CAGGGCTTAT AGCAGGATGT CTGGCTGCAC CTGGCATGAC 2940 

TGTTTGTTTC TCCAAGCCTG CTTTGTGCTT CTCACCTTTG GGTGGGATGC CTTGCCAGTG 3000 

50 TGTCTTACTT GGTTGCTGAA CATCTTGCCA CCTCCGAGTG CTTTGTCTCC ACTCAGTACC 3060 

TTGGATCAGA GCTGCTGAGT TCAGGATGCC TGCGTGTGGT TTAGGTGTTA GCCTTCTTAC 3120 

ATGGATGTCA GGAGAGCTGC TGCCCTCTTG GCGTGAGTTG CGTATTCAGG CTGCTTTTGC 3180 

TGCCTTTGGC CAGAGAGCTG GTTGAAGATG TTTGTAATCG TTTTCAGTCT CCTGCAGGTT 3240 

TCTGTGCCCC TGTGGTGGAA GAGGGCACGA CAGTGCCAGC GCAGCGTTCT GGGCTCCTCA 3300 

55 GTCGCAGGGG TGGGATGTGA GTCATGCGGA TTATCCACTC GCCACAGTTA TCAGCTGCCA 3360 

TTGCTCCCTG TCTGTTTCCC CACTCTCTTA TTTGTGCATT CGGTTTGGTT TCTGTAGTTT 3420 
TAATTTTTAA TAAAGTTGAA TAAAATATAA AAAAAAAAAA AAAAAA 

Seq ID NO 5 155 Protein Sequence 
60 Protein Accession #: AAB17490.1 

1 11 21 31 41 51 

I i I 1 I I 

MAESSESFTM ASSPAQRRRG NDPLTSSPGR SSRRTDALTS SPGRDLPPFE DESEGLLGTE 60 

65 GPLEEEEDGE ELIGDGMERD YRAIPELDAY EAEGUVLDDE DVEELTASQR EAAERAMRQR 120 

DREAGRGLGR MRRGLLYDSD EEDEERPARX RRQVERATED GEEDEEMIES IENLEDLKGH 130 

SVREWVSMAG PRLEIHHRPK NFLRTHVDSH GHNVFKERIS DMCKENRESL WNYEDLAAR 240 

EHVLAYFLPE APAELLQIFD EAALEWLAM YPKYDRITNH IHVRISKLPL VEELRSLRQL 300 

HLNQLIRTSG WTSCTGVLP QLSMVKYNCN KCNPVLGPFC QSQNQEVKPG SCPECQSAGP 360 

70 FBVNMEETIY QNYQRIRIQE SPGKVAAGRL PRSKDAILLA DLVDSCKPGD EIELTGIYHN 420 

NYDGSLNTAN GFPVFATVIL ANHVAKKDNK VAVGELTDED VKMITSLSKD QQIGEKIFAS 480 

IAPSIYGHED IKRGLALALF GGEPKNPGGK HKVRGDINVL LCGDPGTAKS QFLKYIEKVS 540 

SRAIFTTGQG ASAVGLTAYV QRHPVSREWT LEAGALVLAD RGVCLIDEFD KMNDQDRTSI 600 

HEAMEQQSIS ISKAGIVTSL QARCTVIAAA NPIGGRYDPS LTFSENVDLT EPIISRFDIL 660 

75 CWRDTVDPV QDEMLARFW GSHVRHHPSN KEEEGLANGS AAEPAMPNTY GVEPLPQEVL 720 

KKYIIYAKER VHPKLNQMDQ DKVAKMYSDL RKESMATGSI PITVRHIESM IRMAEAHARI 780 

HLRDYVIEDD VNMAIRVMLE SFIDTQKPSV MRSMRKTFAR YLSFRRDNNE LLLFXLKQLV 840 

AEQVTYQRNR FGAQQDTIEV PEKDLVDKAR QINIHNLSAF YDSELFRMNK FSHDLKRKMI 900 
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W3QF 

Seq ID MO; 156 DMA Sequence 
Nucleic Acid Accession #: NM_002497.1 
5 Coding sequence: 135.. 1472 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 

10 GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CGCGACTCTG 120 

GCGACTGGCC GGCCATGCCT TCCCGGGCTG AGGACTATGA AGTGTTGTAC ACCATTGGCA 180 

CAGGCTCCTA CGGCCGCTGC CAGAAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 

GGAAAGAACT TGACTATGGC TCCATGACAG AAGCTGAGAA ACAGATGCTT GTTTCTGAAG 300 

TGAATTTGCT TCGTGAACTG AAACATCCAA ACATCGTTCG TTACTATGAT CGGATTATTG 360 

15 ACCGGACCAA TACAACACTG TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 

GTGTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GTTCTTCGAG 480 

TGATGACTCA GTTGACTCTG GCCCTGAAGG AATGCCACAG ACGAAGTGAT GGTGGTCATA 540 

CCGTATTGCA TCGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 

AGCTTGGAGA CTTTGGGCTA GCTAGAATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 

20 TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAGA 720 

AATCAGATAT CTGGTCATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 780 

TTACAGCTTT TAGCCAGAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 

TTCCATACCG TTACTCTGAT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 

ATTACCATCG ACCTTCTGTT GAAGAAATTC TTGAGAACCC TTTAATAGCA GATTTGGTTG 960 

25 CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 1020 

CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 1080 

GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 1140 

TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG AAGAACTACA 1200 

GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 1260 

30 CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAGAG AACATCATGA 1320 

GGAGTGAGAA TTCTGAGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 1380 

GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGAAA AATTACCAAC 1440 

TGAAAAGCAG ACAGATCCTG GGCATGCGCT AGCCAGGTAG AGAGACACAG AGCTGTGTAC 1500 

AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG TTGAATACTT 1560 

35 GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTCGGAATT GGTTTTACTG 1620 

TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 

ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 

GAACATGAGA TGTGACATTC TAAATCTTGG GAGAAAAAAT AATATTAGGA AAAAAATATT 1800 

TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTGGTA TGCTTACAAT 1860 

40 TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 1920 

CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTCCATTGC 1980 

TCTGTAGTTC AAATCTGTTA GCTTTGTGAA AATTCATCAC TGTGATGTTT GTATTCTTTT 2040 

TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 
TAAAAGAATT CTTCAGTTA 



45 



Seq ID NO: 157 Protein Sequence 
Protein Accession #: NP 002488.1 



1 11 21 31 41 51 

50 | | | | II 

MPSRAEDYEV LYTIGTGSYG RCQKIRRKSD GKILVWKELD YGSMTEAEKQ MLVSEVNLLR 60 

ELKHPNIVRY YDRIIDRTNT TLYIVMEYCE GGDLASVITK GTKERQYLDE EFVLRVMTQL 120 

TLALKECHRR SDGGHTVLHR DLKPANVFLD GKQNVKLGDF GLARILNHDT SFAKTFVGTP 180 

YYMSPEQMNR MSYNEKSDIW SLGCLLYELC ALMPPFTAFS QKELAGKIRE GKFRRIPYRY 240 

55 SDELNEIITR MLNLKDYHRP SVEEILENPL IADLVADEQR RNLERRGRQL GEPEKSQDSS 300 

PVLSELKLKE IQLQERERAL KAREERLEQK EQFJjCVRERL AEDKLARAEN LLKNYSL.LKE 360 

RKFLSLASNP ELLNLPSSVI KKKVHFSGES KEN IKR SENS ESQLTSKSKC KDLKKRLHAA 420 
QLRAQALSDI EKNYQLKSRQ ILGMR 

60 Seq ID NO: 158 DNA Sequence 

Protein Accession &: Eos sequence 

1 11 21 31 41 51 

..III I I' 

03 TGAGTTTGCC CCCTTACCCC CATCCCAGTG AATATTTGCA ATTCCTAAAG ACGTGTTTTG 60 

ATTGTCACAC CTGGGTGGGG AACATGCTAC TGGCATCTAA TGCATAGAGG GCAGTAATGC 120 

TGCTAAACAT CTTTCAACGC ACAGGACAGA GCCCCACAAA AGAGAATTAT CTAGCCCCAA 180 

ATGTCCATAA CACTGCTGTT GAGAAAACCT ACCGCAGGAT CTTACTGGGC TTCATAGGTA 240 

AGCTTGCCTT TGTTCTGGCT TCTGTAGATA TATAAAATAA AGACACTGCC CAGTCCCTCC 300 

70 CTCAACGTCC CGAGCCAGGG CTCAAGGCAA TTCCAATAAC AGTAGAATGA ACACTAAATA 360 

TTGATTTCAA AATCTCAGCA ACTAGAAGAA TGACCAACCA TCCTGGTTGG CCTGGGACTG 420 

TCCTAGTTTT AGCATTGAAA GTTTCAGGTT CCAGGAAAGC CCTCAGGCCT GGGCTGCTGG 480 

TCACCCTAGC AGCTGAGGGA CTCTTCAATA CAGAATTAGT CTTTGTGCAC TGGAGATGAA 540 

TATACTTTAA TTTGTAACAT GTGAAAACAT CTATAAACAT CTACTGAAGC CTGTTCTGTC 600 

75 TGCACCGACA TTTTCATTGA GTACGGATTC TTCCTACCAG ATACAGCTGC TCTACAACTT 660 

TCGAGGGCTG GTATAAAACT AGCTTTTACC TATTTTTAAA AATTACATGA ATAGTAAAAA 720 

CTTGGATTAA CCCAGTATTC GGGTATTTTC AATTTCCTTG GGAGCTTAGA GGACGGACAA 780 

ATAAAAAGAT TATTTCAACA TCAAATATAY GCTATTGTTT ACATATGAAG ATAACCACAT 840 
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ATATGTATAA ATTCACCGTT ACTTTTTAGC AATACTATAA AATCCAACAG AAAAAAATAG 
CATTTACTAT 

Seq ID NO: 15 9 DNA Sequence 

Nucleic Acid Accession ft: Eos sequence 

Coding sequence: 264.. 782 



900 



CCCTGCTCCA 
TCATCGCGGG 
TTCCTCAGAC 
TGCCGGTGGC 
AACTTATCAG 
TGATCGCGGT 
TCAGCAGGCA 
CCAGGCCCAG 
CAGAGAGAGA 
ATAGCTTGGA 
ACACACAAGT 
AGAACATAAA 
TCTGGTATTT 
GAATTCCAAA 
GGAAATGCCA 
CAAAAAGAAT 

AAAACAGGCC 
TAAAGTATGA 
AAATTTTAAA 
TGACTTCCCT 
AAAATAGCAA 



11 
I 

GTCACACCCG 
ACTAATTTTC 
TGAGAACTGT 
ACTACTGAGA 
CAAGGAGCTC 
GGTGTCCCTC 
CATGGCCCAC 
CCCTGGCCAC 
CATCCCAATG 
TAGCTCCTGC 
CGTCTTTTCT 
GGAAATCACA 
TGTCAACCCT 
TATTTTTAAT 
TTTTTCCCCC 
GTGGAGAAGA 
ACCCAAAAAA 
AATGCCCCGG 
GAACTATGGA 
AAGGTTGAAT 
TTGAGCTCTT 
CCACCACCA 



21 
I 

GAAGCTGACT 
CTTAAAATTT 
TTCCAGTATA 
GACGAGGTGC 
ATCATGCTGA 
TTCCTGCAGG 
AAGAGTGAAC 
CATCATCCAC 
TCTGATTCCC 
AGTTCGCCTC 
GACCCTGGAG 
GATTATGTCA 
GCTCTGTCTG 
GGGGTCCAGT 
TTAAACAAGG 
AAACTGATAA 
GCTGTTCGTT 
CAAGAAAGGT 
GTCCATCAGC 
CAGCTGTTGT 
TAATTATTGG 



31 
I 

GGTCCACGCA 
AGACTTGCAC 
TACATCAAGT 
CAGGGTGGTT 
CAGAAGTCAT 
CCTGCTTCCT 
AGATACTGAA 
CTGCTGTCAA 
TTTACAGGCA 
CTGCCTGCCA 
AACTAAAAAA 
ATGTCAATCC 
AGCCAGCGGA 
TCTCTATGGA 
CATGGGGCTC 
ATACACAGAG 
CCTCAAAAAC 
TGAGATCAGA 
AGAGATAGTA 
AGAGTTCTAT 
CAATAAACAA 



41 
I 

CAGCTGAAGC 
AGTAAGGACT 
CACTGAGATC 
CCTGAAAGTG 
GGAGGTCTGG 
CACCGCCATC 
AGCGGCCAGT 
AGAGATGAAG 
TGACAGCGAC 
GGCCACAGAG 
TGACTCCCCG 
AGAAAGACAC 
ATATGATCAA 
TTCTTACATT 
ACAAGTCTAT 
GTCCTCAAGA 
AAAAACAAGG 
TGTTAGGAAG 
GTGAAGTCTC 
TTGGCAATCT 
CTTCTTTAAA 



51 
I 

ATGAGGAAAC 
TCAACTGACC 
TCCAGCACCC 
CCTGAGCCCC 
CATGGCTTAG 
AACTACCTGC 
CTCCAGGTTC 
GAGACTCAGA 
ACACCCTCAG 
GATGTGGATT 
CTGGACTATG 
AAGCCCAGTT 
GTGGCCATGT 
TAATTTGTAG 
GGAGACAGGC 
CCCATGGACT 
CTTGGCTGGG 
AACTTTCAGG 
TCCCCAGGGA 
CATGGTTAAA 
AGTTTTAAAT 



Seq ZD NO: 160 Protein Sequence 
Protein Accession ft: Eos sequence 

1 11 21 31 41 51 

I I I I I 1 

MLTEVMEVWH GLVIAWSLF LQACPLTAIN YLLSRHMAHK SEQILKAASL QVPRPSPGHH 

HPPAVKEMKE TQTERDIPMS DSLYRHDSDT PSDSLDSSCS SPPACQATED VDYTQWFSD 

PGELKNDSPL DYENIKEITD YVNVNPERHK PSFWYPVNPA LSEPAEYDQV AM 

Seq ID NO: 161 DNA Sequence 
Nucleic Acid Accession ft: NMJJ12152 
Coding sequence: 43.. 1104 



CTTCTTTAAA 
GACAAGCACA 
GGAACAAAGC 
TCTAATTCTC 
TACCTGTTGG 
ATGTTTAACA 



AGGCACATGT 
CTGCTCATTT 
TGGAATTGCC 
TACCTTGTTT 
CTGCGGATCT 
TCCATCAGCC 
GCGTTTGTGG 
AGGCAGTGTG 
GTCGTGAACC 
ATGATCTGCT 
GTCCTCAGCA 
GTCTGCAATA 
GTCTTAGG 



11 
I 

TTTCTTTCTA 
TGGACTTTTT 
TTGTGATTGT 
TGGTCATCGC 
CTAATTTAGC 
CAGGCCCAGT 
ACAGTAGCTT 
CAATCATGAG 
TGCTTGTCTG 
TCTGCAACAT 
TCTGGACAGT 
ACGTGTACGT 
GCCGGAGGAC 
TATGCTGGAC 
GCGTGCAGCA 
CCATCATCTA 
GCTTCTCTCA 
GGAGTGACAC 
AAAGCACTTC 



21 
I 

GGATGTTCAC 
TTATAATAGG 
TTTGTGTGTT 
GGCAGTGATC 
TGCTGCCGAT 
TTCAAAAACT 
GACTGCTTCC 
GATGCGGGTC 
GGCCATCGCC 
CTCTGCCTGC 
GTCCAACCTC 
CAAGAGGAAA 
ACCCATGAAG 
CCCGGGCCTG 
TGTGAAAAGG 
CTCCTACAAG 
GGAGAACCCA 
AGGCAGCCAG 
CTAAACTCTG 



31 
I 

TTCTTCTCCA 
AGCAACACTG 
GGGACGTTTT 
AAAAACAGAA 
TTCTTCGCTG 
TTGACTGTCA 
CTCACCAACT 
CATAGCAACC 
ATTTTTATGG 
TCTTCCCTGG 
ATGGCCTTCC 
ACCAACGTCT 
CTAATGAAGA 
GTGGTTCTGC 
TGGTTCCTGC 
GACGAGGACA 
GAGAGGOGTC 
TACATAGAGG 
GATGCCTCTC 



CAATGAATGA 
ATACTGTCGA 



AATTTCATTT 
GAATTGCCTA 
ACCGCTGGTT 
TGCTGGTTAT 
TGACCAAAAA 
GGGCGGTCCC 
CCCCCATTTA 
TCATCATGGT 
TGTCTCCGCA 
GGGTGATGAC 
TCCTOGACGG 
TGCTGGCGCT 
TGTATGGCAC 
CCTCTCGCAT 
ATAGTATTAG 
GGCCCACCCA 



51 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
790 
840 
900 
960 
1020 
1080 
,1140 
1200 
1260 



60 
120 



51 
I 

GTGTCACTAT 
TGACTGGACA 
TATTTTTTTT 
CCCCTTCTAC 
TGTATTCCTG 
TCTCCGTCAG 
CGCCGTGGAG 
GAGGGTGACA 
CACACTGGGC 
CAGCAGGAGT 
TGTGGTGTAC 
TACAAGTGGG 
TGTCrTAGGG 
CCTGAACTGC 
GCTCAACTCC 
CATGAAGAAG 
CCCCTCCACA 
CCAAGGTGCA 
GGTGATGACT 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 



Seq ID NO: 162 Protein Sequence 
Protein Accession ft: NP_036284 

1 11 21 31 41 

I I I I I I 

MNECHYDKHM DFPYNRSNTD TVDDWTGTKL VIVLCVGTPP CLFIFFSNSL VIAAVIKNRK 
FHFPFYYLLA NLAAADFFAG IAYVFLMFNT GPVSKTLTVN RWFLRQGLLD SSLTASLTNI* 
LVIAVERHMS IMRMRVHSNL TKKRVTLLIL LVWAIAIFMG AVPTLGWNCL CNISACSSLA 
PIYSRSYLVF WTVSNLMAFli IMWVYLRIY VYVKRKTNVL SPHTSGSISR RRTPMKLMKT 
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VMTVLGAFW CWTPGLWLL LDGLMCRQCG VQHVKRWPLL LALLNSWNP IIYSYKDEDM 300 
YGTMKKMICC FSQENPERRP SRIPSTVLSR SDTGSQYIED SISQGAVCNK STS 



15 



J Seq ID NO: 163 DNA Sequence 

Nucleic Acid Accession fh NM_020242 
Coding sequence: 72.. 4240 

1 11 21 31 41 51 

10 | I I I i I 

CAGTCGCGCG CGGTGCAGTC GGGAGGTGGA GGCACCGGCT GCATTGTTTT CGGGATCGAG 60 

GGGTGAGGGC GCTATGGCAC CCGGCTGCAA AACTGAGTTA CGCAGCGTGA CAAATGGTCA 120 

GTCTAACCAA CCAAGTAATG AAGGTGATGC CATCAAAGTT TTTGTGCGAA TTCGTCCTCC 180 

TGCAGAAAGA TCTGGGTCAG CTGATGGAGA GCAGAACTTA TGCTTATCTG TGCTGTCCTC 240 

CACGAGTCTC CGGCTGCACT CCAACCCTGA GCCCAAGACC TTCACGTTTG ATCATGTTGC 300 

AGATGTGGAT ACCACTCAGG AATCTGTATT TGCAACTGTG GCTAAAAGCA TTGTGGAGTC 360 

TTGCATGAGC GGTTATAATG GTACCATCTT TGCATATGGA CAGACTGGCT CAGGGAAGAC 420 

ATTTACTATG ATGGGACCAT CTGAATCTGA TAATTTTTCT CATAACCTGA GAGGAGTAAT 4 80 

CCCACGAAGT TTTGAATATT TGTTTTCCTT AATTGATCGT GAAAAAGAAA AGGCTGGAGC 540 

20 TGGAAAGAGT TTCCTTTGTA AGTGTTCCTT TATTGAAATC TACAACGAGC AGATATATGA 600 

TCTACTGGAC TCTGCATCGG CTGGACTGTA CTTAAGGGAG CATATCAAGA AGGGAGTCTT 660 

TGTTGTTGGT GCGGTGGAGC AGGTGGTAAC CTCAGCTGCT GAAGCCTATC AGGTGCTGTC 720 

TGGAGGATGG AGGAATAGAC GTGTGGCATC AACATCAATG AACAGAGAAT CGTCTAGGTC 780 

TCATGCCGTC TTTACAATTA CAATAGAGTC AATGGAGAAA AGTAATGAGA TTGTGAATAT 840 

ACGGACCrCC CTACTCAACC TGGTGGATTT AGCAGGATCT GAAAGGCAAA AAGATACCCA 900 

TGCAGAAGGG ATGAGATTGA AGGAAGCAGG TAACATAAAT CGATCATTGA GCTGCCTGGG 960 

CCAAGTGATT ACAGCACTTG TCGACGTGGG TAATGGAAAA CAGAGACATG TTTGCTACAG 1020 

AGACTCCAAA CTTACCTTCT TACTACGGGA TTCCCTTGGA GGTAATGCCA AAACAGCCAT 1080 

AATTGCAAAT GTTCATCCTG GATCCAGGTG TTTTGGGGAA ACCCTATCAA CACTTAACTT 1140 

TGCTCAAAGA GCCAAGCTGA TTAAAAACAA GGCAGTAGTA AATGAAGACA CCCAAGGAAA 1200 

TGTGAGCCAG CTCCAAGCTG AAGTGAAGAG GCTCAAAGAA CAACTGGCGG AGCTTGCTTC 1260 

AGGACAGACA CCACCAGAAA GCTTCCTGAC CAGAGACAAA AAGAAGACTA ACTATATGGA 1320 

GTATTTCCAG GAAGCAATGT TATTCTTTAA GAAATCTGAA CAGGAAAAGA AGTCTCTGAT 1380 

AGAAAAAGTT ACCCAATTAG AAGACCTCAC CCTCAAAAAG GAAAAATTTA TTCAATCTAA 1440 

TAAAATGATT GTGAAATTCC GAGAGGATCA AATAATACGC TTGGAAAAGC TCCACAAGGA 1500 

ATCCCGGGGA GGTTTTCTGC CTGAGGAGCA GGATCGTTTG CTCTCAGAAT TAAGGAATGA 1560 

GATTCAAACT CTGCGAGAAC AAATAGAGCA CCACCCCAGA GTTGCAAAGT ATGCTATGGA 1620 

AAATCATTCC CTCAGGGAGG AGAATAGAAG ACTGAGATTA TTAGAGCCTG TGAAAAGAGC 1680 

TCAAGAAATG GATGCCCAGA CCATTGGAAA ACTAGAAAAA GCTTTCTCTG AAATAAGTGG 1740 

CATGGAGAAA AGTGACAAAA ATCAGCAAGG ATTTTCACCT AAAGCTCAGA AAGAGCCATG 1800 

TTTGTTTGCA AACACTGAGA AGTTAAAAGC ACAACTCCTG CAAATTCAGA CAGAGCTGAA 1860 

TAATTCAAAG CAAGAATATG AAGAATTCAA AGAACTTACT AGGAAAAGGC AGCTAGAATT 1920 

GGAATCAGAG CTTCAGTCTT TGCAAAAAGC GAACCTTAAT CTTGAAAACC TTTTGGAAGC 1980 

AACAAAAGCC TGCAAGCGGC AAGAAGTTTC TCAGCTGAAT AAAATTCATG CTGAAACACT 2040 

TAAGATTATA ACTACACCAA CCAAGGCCTA CCAACTTCAT TCCCGACCAG TACCAAAATT 2100 

AAGCCCTGAA ATGGGAAGCT TTGGCTCTCT ATACACTCAG AATTCTAGCA TATTAGATAA 2160 

TGATATATTA AATGAGCCAG TTCCTCCTGA GATGAATGAA CAAGCTTTTG AGGCCATTTC 2220 

TGAAGAGCTT AGAACAGTGC AGGAACAAAT GAGTGCTCTT CAAGCCAAAC TGGATGAAGA 2280 

AGAGCATAAA AACCTAAAGC TTCAGCAGCA TGTTGACAAA CTGGAACATC ATTCTACCCA 2340 

AATGCAGGAG CTTTTCTCAT CAGAAAGAAT TGATTGGACC AAACAGCAGG AAGAGCTTCT 2400 

CTCACAGTTG AATGTCCTTG AAAAGCAGCT TCAAGAGACT CAAACTAAAA ATGACTTTTT 2460 

GAAAAGTGAG GTACATGACC TGCGAGTAGT CCTTCATTCT GCTGACAAGG AGCTTTCTTC 2520 

AGTGAAATTG GAATATAGTT CATTCAAAAC GAATCAGGAG AAAGAATTCA ACAAACTTTC 2580 

TGAAAGACAC ATGCATGTAC AGCTTCAATT AGATAATCTC AGGTTAGAAA ACGAAAAGCT 2640 

GCTTGAGAGC AAAGCCTGCC TACAGGATTC CTATGACAAC TTACAAGAAA TAATGAAATT 2700 

TGAGATTGAC CAACTTTCAA GAAACCTCCA AAACTTCAAA AAAGAAAATG AAACTCTGAA 2760 

ATCTGATCTG AATAATTTGA TGGAGCTTCT TGAGGCAGAA AAAGAACGCA ATAACAAATT 2820 

ATCATTACAG TTTGAAGAAG ATAAAGAAAA CAGTTCTAAA GAAATCTTAA AAGTTCTTGA 2880 

GGCTGTACGT CAGGAGAAAC AGAAAGAGAC GGCCAAGTGT GAGCAGCAGA TGGCAAAAGT 2940 

ACAGAAACTA GAAGAGAGCT TGCTTGCTAC TGAAAAAGTG ATCAGTTCCC TGGAAAAGTC 3000 

TAGAGATTCT GATAAGAAAG TTGTAGCTGA CCTCATGAAC CAGATCCAGG AGCTAAGAAC 3060 

ATCGGTCTGT GAGAAAACAG AAACTATAGA CACCCTGAAA CAAGAACTGA AGGACATAAA 3120 

TTGCAAATAC AACTCTGCTT TGGTTGACAG AGAAGAGAGC AGAGTGTTGA TCAAGAAGCA 3180 

GGAAGTGGAT ATTCTGGATC TGAAAGAAAC CCTTAGGCTG AGAATACTTT CTGAGGACAT 3240 

AGAGAGGGAT ATGCTCTGTG AGGACCTGGC TCATGCCACT GAGCAGCTGA ACATGCTCAC 3300 

AGAGGCCTCA AAAAAACACT CGGGGCTGCT GCAQTCTGCC CAGGAAGAAC TGACCAAGAA 3360 

GGAAGCCCTG ATTCAGGAAC TTCAGCACAA GCTAAACCAA AAGAAAGAGG AAGTAGAACA 3420 

GAAGAAGAAT GAATATAACT TCAAAATGAG GCAACTAGAA CATGTGATGG ATTCTGCTGC 3480 

TGAGGATCCC CAGAGTCCTA AGACACCACC TCACTTTCAA ACACATTTGG CAAAACTCCT 3540 

GGAAACACAA GAACAAGAGA TAGAAGATGG AAGAGCCTCT AAGACTTCTT TGGAACACCT 3600 

TGTAACAAAG CTAAATGAAG ACAGAGAAGT CAAAAATGCT GAAATCCTCA GAATGAAGGA 3660 

GCAGTTGCGT GAAATGGAAA ACCTACGCCT GGAAAGTCAG CAGTTAATAG AGAAAAACTG 3720 

GCTCCTGCAA GGTCAGCTGG ATGATATTAA AAGACAAAAG GAAAACAGTG ATCAGAATCA 3780 

TCCAGATAAT CAACAGCTGA AGAATGAACA AGAAGAAAGT ATCAAAGAAA GACTTGCAAA 3840 

75 AAGTAAAATA GTTGAAGAAA TGCTGAAAAT GAAAGCAGAC CTAGAAGAAG TCCAAAGTGC 3900 

CCTTTACAAC AAAGAGATGG AATGCCTTAG AATGACTGAT GAAGTCGAAC GAACCCAAAC 3960 

TTTGGAGTCT AAAGCATTCC AGGAAAAAGA ACAACTGAGA TCAAAGCTGG AAGAAATGTA 4020 

TGAAGAAAGA GAGAGAACAT CCCAGGAGAT GGAAATGTTA AGGAAGCAGG TGGAGTGTCT 4080 
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TGCTGAGGAA AATGGAAAGT TGGTAGGTCA CCAAAATTTG CATCAGAAGA TTCAGTACGT 4140 

AGTGCGACTA AAGAAGGAAA ATGTCAGGCT TGCTGAGGAG ACAGAAAAGT TGCGTGCCGA 4200 

AAATGTATTT TTAAAAGAAA AGAAAAGAAG TGAATCTTGA GGATTCCGGT CAGCTACCTA 4260 

GGCATCACCT TGTTTGAAGA TGTTTCTTCT CTTTTACAAG TAAGACCTAC TCCTGGCCAC 4320 

5 TTAGGAGAGC TGAATTTATG GACCTTAATT ATTAAATGTT TATAAGGTGG TGGTAACCAC 4380 

CTCAAGTTTC TGATGAACAT TCTGCATCCA TATACACCCT GTGACAGTCA GCAGTCTGCT 4440 

ATTAAGTGGC CTACTTCAAG GCTTTGAATC AACTTAAGGG AAAACCTTTT GTCTTTGTAA 4500 

AAATAAAAGC CTGTAGCTAA GGTTTACAGT GGACATTAGC CAGATCATTT TCTTCTTAGA 45S0 

TTATGCCATA ATCTCCTTTG ATTCTTATGG AAGTTCTAAC AATATATGGT GGTTCCAACA 4620 

10 CCTGCAGTGA GTTTAATGAC TGACTTAGTA GCAGGTACAA GAAGCAAACT TGTTAATATA 4680 

GATTATTTTT GTATTCTTAC TTTAGGTATT TTACTTGAGC ATTTTCCATG ACTGTAAATA 4740 
AAGCCATTTT TTAAGATAAA AAAAAAAAAA AAAAA 

Seq ID NO: 164 Protein Sequence 
15 Protein Accession fl: NPJ)64627 

1 11 21 31 41 51 

I I I I I I 

MAPGCKTELR SVTNGQSNQP SNBGDAIKVF VRIRPPAERS GSADGEQNLC LSVLSSTSLR 60 

20 LHSNPEPKTF TFDHVADVDT TQESVFATVA KSIVESCMSG YNGTIFAYGQ TGSGKTFTMM 120 

GPSESDNPSH NLRGVIPRSF EYLFSXjIDRE KEKAGAGKSF LCKCSFIEIY NEQIYDLLDS 180 

ASAGLYLREH IKKGVFWGA VEQWTSAAE AYQVLSGGWR NRRVASTSMN RESSRSHAVF 240 

TITIESMEKS NEIVNIRTSL LNLVDLAGSE RQKDTHAEGM RLKEAGNINR SLSCLGQVIT 300 

ALVDVGNGKQ RHVCYRDSKIi TFLLRDSLGG NAKTA1 IANV HPGSRCFGET LSTLNFAQRA 360 

25 KLIKNKAWN EDTQGNVSQL QAEVKRLKEQ LAELASGQTP PESFLTRDKK KTNYMEYFQE 420 

AMLFFKKSEQ EKKSLIEKVT QLEDLTLKKE KFIQSNKMIV KFREDQIIRL EKLHKESRGG 480 

FLPEEQDRLL SELRNEIQTIj REQIEHHPRV AKYAMENHSL REENRRLRLL EPVKRAQEMD 540 

AQTIAKLEKA FSEISGMEKS DKNQQGFSPK AQKEPCLFAN TEKLKAQLLQ IQTELNNSKQ 600 

EYEEFKELTR KRQLELESEL QSLQKANLNL ENLLEATKAC KRQEVSQLNK IHAETLKIIT 660 

30 TPTKAYQLHS RPVPKLSPEM GSFGSIjYTQN SSILDNDILN EPVPPEMNEQ AFEAISEELR 720 

TVQEQMSALQ AKLDEEEHKN LKLQQHVDKL EHHSTQMQEL FSSER1DWTK QQEELLSQLN 780 

VLEKQLQETQ TKNDFLKSEV HDLRWLHSA DKELSSVKLE YSSFKTNQEK EFNKLSERHM 840 

HVQLQliDNLR LENEKLLESK ACLQDSYDNL QEIMKFEIDQ LSRNLQNFKK ENETLKSDLN 900 

NLMELLEAEK ERNNKLSLQF EEDKENSSKE ILKVLEAVRQ EKQKETAKCE QQMAKVQKLE 960 

35 ESLLATEKVI SSLEKSRDSD KKWADLMNQ IQELRTSVCE KTETIDTLKQ ELKDINCKYN 1020 

SALVDREESR VLIKKQEVDI LDLKETLRLR ILSEDIERDM LCEDLAHATE QLNMLTEASK 10 B0 

KHSGLLQSAQ EELTKKEALI QELQHKLNQK KEEVEQKKNE YNFKMRQLEH VMDSAAEDPQ 1140 

SPKTPPHFQT HLAKLLETQE QEIEDGRASK TSLERLVTKL NEDREVKNAE ILRMKEQLRE 1200 

MENLRDESQQ LIEKNWLLQG QLDDIKRQKE NSDQNHPDNQ QLKNEQEESI KERLAKSKIV 1260 

40 EEMLKMKADL EEVQSALYNK EMECLRMTDE VERTQTLESK AFQEKEQLRS KLEEMYEERE 1320 

RTSQEMEMLR KQVECLAEEN GKLVGHQNLH QKIQYWRLK KENVRLAEET EKLRAENVFL 1380 



65 



Seq ID NO: 165 DNA Sequence 
45 Nucleic Acid Accession #: CAT cluster 

1 11 21 31 41 51 

i I I I I I 

r r m - n rrr ttttttcaca tgctgatgtt gctgccgatg atttcaacgc ctggctttga so 

50 gattccgtga gtagtcttga ataatttaaa attcgaaaat CAAATTCTAC TTATTTTCTC 120 

TTAATGCTAT TGTATTTCCT AATTCTCAGC TTTAACATGT AAGAAAGTAC TTTCGCTAGG 180 

GGTCTTAATT GAATGGTGGG GTCGAGATGA CTGCGTCAGA ATTAAATCTC TGGAAGACCT 240 

CTGAGCTCCT TTTAAAATCA TCAACAAGCG AAAATCCTTA TCAATAGOGA TGTGGGAATG 300 

CATTAGGTAC AGTATTTTAA ACATACAAAA CCTAGGCATA TTAAAAAGCA CTCCTCTGGT 360 

55 AATTTAATAA GGAATAATGA TGTCCTTAAG TTTATTTTAA TCAGCAAGTA TGACTCAATT 420 

TGAAAATATG AGAACAAATA GATTTAAATA GGAACACCCA GTAAACTATG GTATGCAAAT 480 

AAACTCAGAG GTAAACTTGT GAATACATAA ATCTAAATAA GTCAGTTACC ■ ATCAAAATAT 540 

TACGTGATCC TATATTTTTC TGTCCACGTC TTAAAAGTAT TTTTTACCTT GGGGCTTTAT 600 

TTTCTTTCCC TCTTCCCAAG ATTATCCAAG GCAGTTCCAA TACGCGTTTT CAATAATGGT 660 

60 AATTAAACTT TTGGAGGGTA ATCGCCAGCT GTTTCCAGAA AAATACTTTT AATAGGTGGT 720 

AGTCCTCCCT GTAACTGGGT TTTCCCGGGA ATCCGGGGGG GGCATGTTGT TTTTATTGTC 780 

GCCATTTTTG TTTCGTCTTT TTTGTTTTGT TGGTGGTCAG GTTTTCTTTT CATAGCGGGG 840 

GAGAAGATTT TGGTCGGACT AGCGCCATCC AGCGGGTTTA GAGAAGGAGC ACACTTTTCC 900 
CCGAATAGTG GCTTTTGGAT GAACAATGAA 



Seq ID NO: 166 DNA Sequence 

Nucleic Acid Accession #: NM_006953.1 

Coding sequence: 33.-896 



70 1 11 21 31 41 51 

1 I I I 1 ! 

CCGTTCCGCG CTCTGGCGGC TCCTCCCGGG CGATGCCTCC GCTCTGGGCC CTGCTGGCCC 60 

TCGGCTGCCT GCGGTTCGGC TCGGCTGTGA ACCTGCAGCC CCAACTGGCC AGTGTGACTT 120 

TCGCCACCAA CAACCCCACA CTTACCACTG TGGCCTTGGA AAAGCCTCTC TGCATGTTTG 180 

75 ACAGCAAAGA GGCCCTCACT GGCACCCACG AGGTCTACCT GTATGTCCTG GTCGACTCAG 240 

CCATTTCCAG GAATGCCTCA GTGCAAGACA GCACCAACAC CCCACTGGGC TCAACGTTCC 300 

TACAAACAGA GGGTGGGAGG ACAGGTCCCT ACAAAGCTGT GGCCTTTGAC CTGATCCCCT 360 

GCAGTGACCT GCCCAGCCTG GATGCCATTG GGGATGTGTC CAAGGCCTCA CAGATCCTGA 420 
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ATGCCTACCT GGTCAGGGTG GGTGCCAACG GGACCTGCCT GTGGGATCCC AACTTCCAGG 480 

GCCTCTGTAA CGCACCCCTG TCGGCAGCCA CGGAGTACAG GTTCAAGTAT GTCCTGGTCA S40 

ATATGTCCAC GGGCTTGGTA GAGGACCAGA CCCTGTGGTC GGACCCCATC CGCACCAACC 600 

AGCTCACCCC ATACTCGACG ATCGACACGT GGCCAGGCCG GCGGAGCGGA GGCATGATCG 6 SO 

5 TCATCACTTC CATCCTGGGC TCCCTGCCCT TCTTTCTACT TGTGGGTTTT GCTGGCGCCA 720 

TTGCCCTCAG CCTCGTGGAC ATGGGGAGTT CTGATGGGGA AACGACTCAC GACTCCCAAA 780 

TCACTCAGGA GGCTGTTCCC AAGTCGCTGG GGGCCTCGGA GTCTTCCTAC ACGTCCGTGA 840 

ACCGGGGGCC GCCACTGGAC AGGGCTGAGG TGTATTCCAG CAAGCTCCAA GACTGAGCCC 900 

AGCACCACCC CTGGGCAGCA GCATCCTCCT CTCTGGCCTT GCCCCAGGCC CTGCAGCGGT 960 

10 GGTTGTCACA CCCTGACTTC AGGGAAGGTG AAACAGGGCT TGTCCCTCCA ACTGCAGGAA 1020 
AACCCTTAAT AAAATCTTCT GATGAGTTCT AAAAAAAAA 



15 



Seq ID KO: 167 Protein Sequence 
Protein Accession ft-. NP_0Q88B4.1 



1 11 21 31 41 51 

I I 1 I I 1 

MPPLWALLAL GCLRFGSAVN LQPQLASVTP ATNNPTLTTV ALEKPLCMPD SKEALTGTHE 60 
VYLYVLVDSA ISRNASVQDS TNTPLGSTFL QTEGGRTGPY KAVAFDLIPC SDLPSLDAIG 120 
20 DVSKASQ1LN AYLVRVGANG TCLWDPNFQG LCNAPLSAAT EYRPKYVLVN MSTGLVEDQT 180 
LWSDPIRTNQ JUTPYSTIDTW PGRRSGGMIV ITSIIiGSLPF FLLVGFAGAI ALSLVDMGSS 240 
DGETTHDSQI TQEAVPKSLG ASESSYTSVN RGPPLDRAEV YSSKLQD 

25 Seq ID NO: 168 DNA Sequence 

Nucleic Acid Accession »: NM_005672.i 
Coding sequencei 18.. 389 

1 11 21 31 41 51 

30 | | | | I I 

AGGGAGAGGC AGTGACCATG AAGGCTGTGC TGCTTGCCCT GTTGATGGCA GGCTTGGCCC 60 

TGCAGCCAGG CACTGCCCTG CTGTGCTACT CCTGCAAAGC CCAGGTGAGC AACGAGGACT 120 

GCCTGCAGGT GGAGAACTGC ACCCAGCTGG GGGAGCAGTG CTGGACCGCG CGCATCCGCG 180 

CAGTTGGCCT CCTGACCGTC ATCAGCAAAG GCTGCAGCTT GAACTGCGTG GATGACTCAC 240 

35 AGGACTACTA CGTGGGCAAG AAGAACATCA CGTGCTGTGA CACCGACTTG TGCAACGCCA 300 

GCGGGGCCCA TGCCCTGCAG CCGGCTGCCG CCATCCTTGC GCTGCTCCCT GCACTCGGCC 360 

TGCTGCTCTG GGGACCCGGC CAGCTATAGG CTCTGGGGGG CCCCGCTGCA GCCCACACTG 420 

GGTGTGGTGC CCCAGGCCTT TGTGCCACTC CTCACAGAAC CTGGCCCAGT GGGAGCCTGT 480 

CCTGGTTCCT GAGGCACATC CTAACGCAAG TTTGACCATG TATGTTTGCA CCCCTTTTCC 540 

40 CCNAACCCTG ACCTTCCCAT GGGCCTTTTC CAGGATTCCN ACCNGGCAGA TCAGTTTTAG 600 

TGANACANAT CCGCNTGCAG ATGGCCCCTC CAACCNTTTN TGTTGKTGTT TCCATGGCCC 660 

AGCATTTTCC ACCCTTAACC CTGTGTTCAG GCACTTNTTC CCCCAGGAAG CCTTCCCTGC 720 

CCACCCCATT TATGAATTGA GCCAGGTTTG GTCCGTGGTG TCCCCCGCAC CCAGCAGGGG 780 

ACAGGCAATC AGGAGGGCCC AGTAAAGGCT GAGATGAAGT GGACTGAGTA GAACTGGAGG 840 

45 ACAAGAGTTG ACGTGAGTTC CTGGGAGTTT CCAGAGATGG GGCCTGGAGG CCTGGAGGAA 900 

GGGGCCAGGC CTCACATTTG TGGGGNTCCC GAATGGCAGC CTGAGCACAG OGTAGGCCCT 960 
TAATAAACAC CTGTTGGATA AGCCAAAAAA 

Seq ID NO: 169 Protein Sequence 
50 Protein Accession Jh NP_005663.1 

1 11 21 31 41 51 

I I I I I I 

MKAVLLALLM AGLALQPGTA LLCYSCKAQV SNEDCLQVEN CTQLGEQCWT ARIRAVGLLT 60 
55 VISKGCSLNC VDDSQDYYVG KKNITCCDTD LCNASGAHAL QPAAAILALL PALGLLLWGP 120 
GQL 

Seq ID NO: 170 DNA Sequence 
Nucleic Acid Accession Bt NM_006952.l 
60 Coding sequence: 11. .793 

1 11 21 31 41 51 

I I I i I I 

AATCCCGACA ATGGCGAAAG ACAACTCAAC TGTTCGTTGC TTCCAGGGCC TGCT GATT TT 60 

65 TGGAAATGTG ATTATTGGTT GTTGCGGCAT TGCCCTGACT GCGGAGTGCA TCTTCTTTGT 120 

ATCTGACCAA CACAGCCTCT ACCCACTGCT TGAAGCCACC GACAACGATG ACATCTATGG 180 

GGCTGCCTGG ATCGGCATAT TTGTGGGCAT CTGCCTCTTC TGCCTGTCTG TTCTAGGCAT 240 

TGTAGGCATC ATGAAGTCCA GCAGGAAAAT TCTTCTGGCG TATTTCATTC TGATGTTTAT 300 

AGTATATGCC TTTGAAGTGG CATCTTGTAT CACAGCAGCA ACACAACGAG ACTTTTTCAC 360 

70 ACCCAACCTC TTCCTGAAGC AGATGCTAGA GAGGTACCAA AACAACAGCC CTCCAAACAA 420 

TGATGACCAG TGGAAAAACA ATGGAGTCAC CAAAACCTGG GACAGGCTCA TGCTCCAGGA 480 

CAATTGCTGT GGCGTAAATG GTCCATCAGA CTGGCAAAAA TACACATCTG CCTTCCGGAC 540 

TGAGAATAAT GATGCTGACT ATCCCTGGCC TCGTCAATGC TGTGTTATGA ACAATCTTAA 600 

AGAACCTCTC AACCTGGAGG CTTGTAAACT AGGCGTGCCT GGTTTTTATC ACAATCAGGG 660 

75 CTGCTATGAA CTGATCTCTG GTCCAATGAA CCGACACGCC TGGGGGGTTG CCTGGTTTGG 720 

ATTTGCCATT CTCTGCTGGA CTTTTTGGGT TCTCCTGGGT ACCATGTTCT ACTGGAGCAG 780 
AATTGAATAT TAAGAA 
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Seq id NO: 171 Protein Sequence 
Protein Accession #: NPJ308883.1 

1 11 21 31 41 51 

111)11 

MAKDNSTVRC FQGLL.IFGNV IIGCCGIALT AECIFFVSDQ HSLYPLLEAT DNDDIYGAAH 60 

IGIFVGICLF CLSVLGIVGI MKSSRKILLA YFILMFIVYA FEVASCITAA TQRDFFTPNL 120 

FLKQMLERYQ NNSPPNNDDQ WKNNGVTKTW DRLMLQDNCC GVNGPSDWQK YTSAFRTENN 180 

DADYPWPRQC CVMNNLKEPL NLEACKLGVP GFYHNQGCYE LISGPMNRHA WGVAWFGFAI 240 
LCWTFWVLLG TMFYWSRIEY 

Seq ID NO: 172 DNA Sequence 

Nucleic Acid Accession #: Eos sequence 

Coding sequence: 1 . . 672 



1 11 21 31 41 51 

111111 

ATGAGGCTCC AAAGACCCCG ACAGGCCCCG GCGGGTGGGA GGCGCGCGCC CCGGGGCGGG 60 

CGGGGCTCCC CCTACCGGCC AGACCCGGGG AGAGGCGCGC GGAGGCTGCG AAGGTTCCAG 120 

20 AAGGGCGGGG AGGGGGCGCC GCGCGCTGAC CCTCCCTGGG CACCGCTGGG GACGATGGCG 180 

CTGCTCGCCT TGCTGCTGGT CGTGGCCCTA CCGCGGGTGT GGACAGACGC CAACCTGACT 240 

GCGAGACAAC GAGATCCAGA GGACTCCCAG CGAACGGACG AGGGTGACAA TAGAGTGTGG 300 

TGTCATGTTT GTGAGAGAGA AAACACTTTC GAGTGCCAGA ACCCAAGGAG GTGCAAATGG 360 

ACAGAGCCAT ACTGCGTTAT AGCGGCCGTG AAAATATTTC CACGTTTTTT CATGGTTGCG 420 

25 AAGCAGTGCT CCGCTGGTTG TGCAGCGATG GAGAGACCCA AGCCAGAGGA GAAGCGGTTT 480 

CTCCTGGAAG AGCCCATGCC CTTCTTTTAC CTCAAGTGTT GTAAAATTCG CTACTGCAAT 540 

TTAGAGGGGC CACCTATCAA CTCATCAGTG TTCAAAGAAT ATGCTGGGAG CATGGGTGAG 600 

AGCTGTGGTG GGCTGTGGCT GGCCATCCTC CTGCTGCTGG CCTCCATTGC AGCCGGCCTC 660 
AGCCTGTCTT GA 



Seq ID NO: 173 Protein Sequence 
Protein Accession 8: Eos sequence 



1 11 21 31 41 51 

35 | | | | | j 

MRLQRPRQAP AGGRRAPRGG RGSPYRPDPG RGARRLRRFQ KGGEGAPRAD PPWAPLGTMA 60 

LLALLLWAL PRVWTDANLT ARQRDPEDSQ RTDEGDNRVW CHVCERENTF ECQNPRRCKW 120 

TEPYCVIAAV KIFPRFFMVA KQCSAGCAAM ERPKPEBKRF LLEEPMPFFY LKCCKI RYCN 180 
LEGPPINSSV FKEYAGSMGE SCGGLWLAIL LLLASIAAGL SLS 



Seq ID NO: 174 DNA Sequence 
Nucleic Acid Accession 8: XM_057014 
Coding sequence: 143.. 874 



1 11 21 31 41 51 

I I I I I 1 

GGGA6GGAGA GAGGCGCGCG GGTGAAAGGC GCATTGATGC AGCCTGCGGC GGCCTCGGAG 60 

CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 

CGCTGCCCGG CAGCCGGGAG CCATGCGACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 

CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 

CCCCAAGGGG AAGCAAAAGG CGCAGCTCCO GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 

AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 

CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 

55 TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 

ATTGAATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 

AAATAGTGCT CTAAGAGTTT TGTTCAGTGG CTCACTTCGG CTAAAATGCA GAAATGCATG 600 

CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 

AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 720 

OU CACTTCTTCT GTGGAAGGAC TTTGTGAAGG AATTGGTGCT GGATTAGTGG ATGTTGCTAT 780 

CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGGAGATGCT TCTACTGGAT GGAATTCAGT 840 

TTCTCGCATC ATTATTGAAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 

TTTTTTTATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTAAATAAG TTTATGTATA 960 

CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 1020 

TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 

TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGGA ATATTGTTGT 1140 

GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 

TGTACAATTT GTAAATGTTA AGAATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTTAAA AAAAAAAAAA AAAA 



Seq ID NO: 175 Protein Sequence 
Protein Accession 8: xp_057014 



l 11 21 31 41 51 

75 | | | j 1 j 

MRPQGPAASP QRLRGLlLLLI, llqlpapssa seipkgkqka qlrqrewdl YNGMCLQGPA 60 
GVPGRDGSPG ANGIPGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKIAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT PNGAECSGPL PIEAIIYLDQ 180 
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GSPEMNSTIN IHRTSSVEGL C EG I GAG LVD VAIWVGTCSD YPKGDASTGW NSVSRIIIEB 240 ' 
LPK I 

Seq ID NO: 176 DNA Sequence 
5 Nucleic Acid Accession 8: XM_084007 
Coding sequence: 1..2268 

1 11 21 31 41 51 

in 1 1 1 1 1 1 

1U ATGGCGAGGA AGTTATCTGT AATCTTGATC CTGACCTTTG CCCTCTCTGT CACAAATCCC 60 

CTTCATGAAC TAAAAGCAGC TGCTTTCCCC CAGACCACTG AGAAAATTAG TCCGAATTGG 120 

GAATCTGGCA TTAATGTTGA CTTCGCAATT TCCACACGGC AATATCATCT ACAACAGCTT 180 

TTCTACCGCT ATGGAGAAAA TAATTCTTTG TCAGTTGAAG GGTTCAGAAA ATTACTTCAA 240 

AATATAGGCA TAGATAAGAT TAAAAGAATC CATATACACC ATGACCACGA CCATCACTCA 300 

15 GACCACGAGC ATCACTCAGA CCATGAGCGT CACTCAGACC ATGAGCATCA CTCAGACCAC 360 

GAGCATCACT CTGACCATGA TCATCACTCC CACCATAATC ATGCTGCTTC TGGTAAAAAT 420 

AAGCGAAAAG CTCTTTGCCC AGACCATGAC TCAGATAGTT CAGGTAAAGA TCCTAGAAAC 480 

AGCCAGGGGA AAGGAGCTCA CCGACCAGAA CATGCCAGTG GTAGAAGGAA TGTCAAGGAC 540 

AGTGTTAGTG CTAGTGAAGT GACCTCAACT GTGTACAACA CTGTCTCTGA AGGAACTCAC 600 

20 TTTCTAGAGA CAATAGAGAC TCCAAGACCT GGAAAACTCT TCCCCAAAGA TGTAAGCAGC 660 

TCCACTCCAC CCAGTGTCAC ATCAAAGAGC CGGGTGAGCC GGCTGGCTGG TAGGAAAACA 720 

AATGAATCTG TGAGTGAGCC CCGAAAAGGC TTTATGTATT CCAGAAACAC AAATGAAAAT 780 

CCTCAGGAGT GTTTCAATGC ATCAAAGCTA CTGACATCTC ATGGCATGGG CATCCAGGTT 840 

CCGCTGAATG CAACAGAGTT CAACTATCTC TGTCCAGCCA TCATCAACCA AATTGATGCT 900 

25 AGATCTTGTC TGATTCATAC AAGTGAAAAG AAGGCTGAAA TCCCTCCAAA GACCTATTCA 960 

TTACAAATAG CCTGGGTTGG TGGTTTTATA GCCATTTCCA TCATCAGTTT CCTGTCTCTG 1020 

CTGGGGGTTA TCTTAGTGCC TCTCATGAAT CGGGTGTTTT TCAAATTTCT CCTGAGTTTC 1080 

CTTGTGGCAC TGGCCGTTGG GACTTTGAGT GGTGATGCTT TTTTACACCT TCTTCCACAT 1140 

TCTCATGCAA GTCACCACCA TAGTCATAGC CATGAAGAAC CAGCAATGGA AATGAAAAGA 1200 

30 GGACCACTTT TCAGTCATCT GTCTTCTCAA AACATAGAAG AAAGTGCCTA TTTTGATTCC 1260 

ACGTGGAAGG GTCTAACAGC TCTAGGAGGC CTGTATTTCA TGTTTCTTGT TGAACATGTC 1320 

CTCACATTGA TCAAACAATT TAAAGATAAG AAGAAAAAGA ATCAGAAGAA ACCTGAAAAT 1380 

GATGATGATG TGGAGATTAA GAAGCAGTTG TCCAAGTATG AATCTCAACT TTCAACAAAT 1440 

GAGGAGAAAG TAGATACAGA TGATCGAACT GAAGGCTATT TACGAGCAGA CTCACAAGAG 1500 

35 CCCTCCCACT TTGATTCTCA GCAGCCTGCA GTCTTGGAAG AAGAAGAGGT CATGATAGCT 1560 

CATGCTCATC CACAGGAAGT CTACAATGAA TATGTACCCA GAGGGTGCAA GAATAAATGC ,1620 

CATTCACATT TCCACGATAC ACTCGGCCAG TCAGACGATC TCATTCACCA CCATCATGAC 1680 

TACCATCATA TTCTCCATCA TCACCACCAC CAAAACCACC ATCCTCACAG TCACAGCCAG 1740 

CGCTACTCTC GGGAGGAGCT GAAAGATGCC GGCGTCGCCA CTTTGGCCTG GATGGTGATA 1800 

40 ATGGGTGATG GCCTGCACAA TTTCAGCGAT GGCCTAGCAA TTGGTGCTGC TTTTACTGAA 1860 

GGCTTATCAA GTGGTTTAAG TACTTCTGTT GCTGTGTTCT GTCATGAGTT GCCTCATGAA 1920 

TTAGGTGACT TTGCTGTTCT ACTAAAGGCT GGCATGACCG TTAAGCAGGC TGTCCTTTAT 1980 

AATGCATTGT CAGCCATGCT GGCGTATCTT GGAATGGCAA CAGGAATTTT CATTGGTCAT 2040 

TATGCTGAAA ATGTTTCTAT GTGGATATTT GCACTTACTG CTGGCTTATT CATGTATGTT 2100 

45 GCTCTGGTTG ATATGGTACC TGAAATGCTG CACAATGATG CTAGTGACCA TGGATGTAGC 2160 

CGCTGGGGGT ATTTCTTTTT ACAGAATGCT GGGATGCTTT TGGGTTTTGG AATTATGTTA 2220 
CTTATTTCCA TATTTGAACA TAAAATCGTG TTTCGTATAA ATTTCTAG 

Seq ID NO: 177 Protein Sequence 
50 Protein Accession 8: XP_084007 

1 11 21 31 41 51 

I I I I I I 

MARKLSVILI LTFALSVTNP LHELKAAAPP QTTEKISPNW ESGINVDLAI STRQYHLQQL 60 

55 FYRYGBNNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 120 

EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 180 

SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 

NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEPNYL CPAIINQIDA 300 

RSCLIHTSEK KAEIPPKTYS LQIAWVGGPI AISIISFLSIi LGVILVPLMN RVFFKFLIiSF 360 

60 LVALAVGTLS GDAFLHLLPH SHASHHHSHS HEEPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 

TWKGLTALGG LYFMFLVEHV LTLIKQPKDK KKKNQKKPEN DDDVEIKKQL SKYESQLSTN 480 

EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 

HSHFHDTLGQ SDDLIHHHHD YHHILHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMVI 600 

MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLKA GMTVKQAVLY 660 

65 NALSAMLAYL GMATGIFIGH YAENVSMW1P ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYPFLQNA GMLLGFGIML LISIFEHKIV FRINF 

Seq ID NO: 178 DNA Sequence 
Nucleic Acid Accession §: BC010423 
70 Coding sequence t 248.. 1780 

1 11 21 31 41 51 

I I I 1 I 1 

CACAGCGTGG GAAGCAGCTC TGGGGGAGCT OGGAGCTCCC GATCAOGGCT TCTTGGGGGT 60 

75 AGCTACGGCT GGGTGTGTAG AACGGGGCCG GGGCTGGGGC TGGGTCCCCT AGTGGAGACC 120 

CAAGTGCGAG AGGCAAGAAC TCTGCAGCTT CCTGCCTTCT GGGTCAGTTC CTTATTCAAG 180 

TCTGCAGCCG GCTCCCAGGG AGATCTCGGT GGAACTTCAG AAACGCTGGG CAGTCTGCCT 240 

TTCAACCATG CCCCTGTCCC TGGGAGCCGA GATGTGGGGG CCTGAGGCCT GGCTGCTGCT 300 
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GCTGCTACTG CTGGCATCAT TTACAGGCCG GTGCCCCGCG GGTGAGCTGG AGACCTCAGA 360 

CGTGGTAACT GTGGTGCTGG GCCAGGACGC AAAACTGCCC TGCTTCTACC GAGGGGACTC 420 

CGGCGAGCAA GTGGGGCAAG TGGCATGGGC TCGGGTGGAC GCGGGCGAAG GCGCCCAGGA 480 

ACTAGCGCTA CTGCACTCCA AATACGGGCT TCATGTGAGC CCGGCTTACG AGGGCCGCGT 540 

5 GGAGCAGCCG CCGCCCCCAC GCAACCCCCT GGACGGCTCA GTGCTCCTGC GCAACGCAGT 600 

GCAGGCGGAT GAGGGCGAGT ACGAGTGCCG GGTCAGCACC TTCCCCGCCG GCAGCTTCCA 660 

GGCGCGGCTG CGGCTCCGAG TGCTGGTGCC TCCCCTGCCC TCACTGAATC CTGGTCCAGC 720 

ACTAGAAGAG GGCCAGGGCC TGACCCTGGC AGCCTCCTGC ACAGCTGAGG GCAGCCCAGC 780 

CCCCAGCGTG ACCTGGGACA CGGAGGTCAA AGGCACAACG TCCAGCCGTT CCTTCAAGCA 840 

11) CTCCCGCTCT GCTGCCGTCA CCTCAGAGTT CCACTTGGTG CCTAGCCGCA GCATGAATGG 900 

GCAGCCACTG ACTTGTGTGG TGTCCCATCC TGGCCTGCTC CAGGACCAAA GGATCACCCA 960 

CATCCTCCAC GTGTCCTTCC TTGCTGAGGC CTCTGTGAGG GGCCTTGAAG ACCAAAATCT 1020 

GTGGCACATT GGCAGAGAAG GAGCTATGCT CAAGTGCCTG AGTGAAGGGC AGCCCCCTCC 1080 

CTCATACAAC TGGACACGGC TGGATGGGCC TCTGCCCAGT GGGGTACGAG TGGATGGGGA 1140 

1 J CACTTTGGGC TTTCCCCCAC TGACCACTGA GCACAGCGGC ATCTACGTCT GCCATGTCAG 1200 

CAATGAGTTC TCCTCAAGGG ATTCTCAGGT CACTGTGGAT GTTCTTGACC CCCAGGAAGA 1260 

CTCTGGGAAG CAGGTGGACC TAGTGTCAGC CTCGGTGGTG GTGGTGGGTG TGATCGCCGC 1320 

ACTCTTGTTC TGCCTTCTGG TGGTGGTGGT GGTGCTCATG TCCCGATACC ATCGGCGCAA 1380 

GGCCCAGCAG ATGACCCAGA AATATGAGGA GGAGCTGACC CTGACCAGGG AGAACTCCAT 1440 

20 COGGAGGCTG CATTCCCATC ACACGGACCC CAGGAGCCAG CCGGAGGAGA GTGTAGGGCT 1500 

GAGAGCCGAG GGCCACCCTG ATAGTCTCAA GGACAACAGT AGCTGCTCTG TGATGAGTGA 1560 

AGAGCCCGAG GGCCGCAGTT ACTCCACGCT GACCACGGTG AGGGAGATAG AAACACAGAC 1620 

TGAACTGCTG TCTCCAGGCT CTGGGCGGGC CGAGGAGGAG GAAGATCAGG ATGAAGGCAT 1680 

CAAACAGGCC ATGAACCATT TTGTTCAGGA GAATGGGACC CTACGGGCCA AGCCCACGGG 1740 

25 CAATGGCATC TACATCAATG GGOGGGGACA CCTGGTCTGA CCCAGGCCTG CCTCCCTTCC 1800 

CTAGGCCTGG CTCCTTCTGT TGACATGGGA GATTTTAGCT CATCTTGGGG GCCTCCTTAA 1860 

ACACCCCCAT TTCTTGCGGA AGATGCTCCC CATCCCACTG ACTGCTTGAC CTTTACCTCC 1920 

AACCCTTCTG TTCATCGGGA GGGCTCCACC AATTGAGTCT CTCCCACCAT GCATGCAGGT 1980 

CACTGTGTGT GTGCATGTGT GCCTGTGTGA GTGTTGACTG ACTGTGTGTG TGTGGAGGGG 2040 

30 TGACTGTCCG TGGAGGGGTG ACTGTGTCCG TGGTGTGTAT TATGCTGTCA TATCAGAGTC 2100 

AAGTGAACTG TGGTGTATGT GCCACGGGAT TTGAGTGGTT GCGTGGGCAA CACTGTCAGG 2160 

GTTTGGCGTG TGTGTCATGT GGCTGTGTGT GACCTCTGCC TGAAAAAGCA GGTATTTTCT 2220 

CAGACCCCAG AGCAGTATTA ATGATGCAGA GGTTGGAGGA GAGAGGTGGA GACTGTGGCT 22 BO 

CAGACCCAGG TGTGCGGGCA TAGCTGGAGC TGGAATCTGC CTCCGGTGTG AGGGAACCTG 2340 

35 TCTCCTACCA CTTCGGAGCC ATGGGGGCAA GTGTGAAGCA GCCAGTCCCT GGGTCAGCCA 2400 

GAGGCTTGAA CTGTTACAGA AGCCCTCTGC CCTCTGGTGG CCTCTGGGCC TGCTGCATGT 2460 

ACATATTTTC TGTAAATATA CATGCGCCGG GAGCTTCTTG CAGGAATACT GCTCCGAATC 2520 

ACTTTTAATT TTTTTCTTTT TTTTTTCTTG CCCTTTCCAT TAGTTGTATT TTTTATTTAT 2580 

TTTTATTTTT ATTTTTTTTT AGAGTTTGAG TCCAGCCTGG ACGATATAGC CAGACCCTGT 2640 

40 CTGTAAAAAA ACCAAAACCC AAAAAAAAAA AAAAAAAAAA 



45 



60 



Seq ID NO: 179 Protein Sequence 
Protein Accession #: AAH10423 



11 21 31 41 51 

I ! I I I I 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDW TWLGQDAKL PCFYRGDSGE 60 

QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 

DEGEYECRVS TFPAGSPQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 180 

50 VTWDTEVKGT TSSRSFKHSR SAAVTSEPHI* VPSRSMNGQP LTCWSHPGL LQDQRITHIL 240 

HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SGVRVDGDTL 300 

GPPPLTTEHS GIYVCHVSNE FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 360 

FCLLVWWL MSRYHRRKAQ QMTQKYEEEI* TLTRENSIRR LHSHHTDPRS QPEESVGLRA 420 

EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETOTEL LSPGSGRAEE EEDQDEGIKQ 480 

55 AMNHFVQENG TLRAKPTGNG IYINGRGHLV 



Seq ID NO; 180 DNA Sequence 

Nucleic Acid Accession 8: Eos sequence 

Coding sequence: 482.. 3007 



1 11 21 31 41 51 

I I I I I I 

AACTGAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 60 

cc TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 120 

65 TGCATTCAGC AGCTTGCAAA TGGTTAACTA TATGCAAAAA AGTCAGCATA GCTGTGAAGT 180 

ATGCCGTGAA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGGATGCT CTAGTATGCA 240 

CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 

CTTTATGAAG CTATGGGACT TGACAAAAAG TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 

A GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 

70 ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 

CATGAAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 

CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGGAAAA 600 

AGATGGCACA ATGCTAATAA ATTGTGAAGC AAAAGGTATC AAGATGGTAT CTGAAATAAG 660 

_ c TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 

75 CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 

TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 

TCACAATTCT TTAGAAATTC TTAAAGAGGA TACTTTCCAT GGACTGGAAA ACCTGGAATT 900 

CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 
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CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 

CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTGCCTTA 1080 

TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTGGATCTT CAGTTGGAGG ACAACAAATG 1140 

GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 

5 TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 

ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 

AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 

CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 

ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 

10 ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 15 SO 

TCCTCCGCAA AATCCTAGAA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATGAA 1620 

GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 

AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 

TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 

15 ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1860 

ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 

AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 

TAATATTTTG GATGATCTTG ATTTACTAAC CCAGATTGAC CTTGAGGATA ACCCCTGGGA 2040 

CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 

20 GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 

AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 

TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 

TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTGATTA TGTTCATCAC 2340 

TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA GAAGATACAA 2400 

25 AAAGAAACAA GTAGATGAGC AAATGAGAGA CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 

GTATGGCCAT AAAACCACTC ATCACACTAC TGAAAGACCC TCTGCCTCAC TCTATGAACA 2520 

GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 

GGAAGAGGAA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 

TCTTTTGGAA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAAACCAC 2700 

30 GAACCAATCA ACAGAATTTT TATCCTTCCA AGATGCCAGC TCATTGTACA GAAACATTTT 2760 

AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 

TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2 880 

GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2 940 

TTTTGAACTT AAAGCTAATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGGAGCAGCA 3000 

35 AACATAGATG GAGAGTTTGA GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 

ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 

AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 

GCATTATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 3238 

40 Seq ID NO: 1B1 Protein Sequence 

Protein Accession #s Eos sequence 

1 11 21 31 41 51 

AC I I I I I I 

45 MKLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMLINCEA KGIKMVSEIS 60 

VPPSRPPQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNI ADIEIGAFNG LGLLKQLHIN 120 

HNSLEILKED TFHGLENLEF LQADNNFITV IEPSAFSKLN RLKVLILNDN AXESLPPNIF 180 

RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 

IIGDWCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 

50 TSILKLPTKA PGLIPYITKP STQLPGPYCP IPCNCKVLSP SGLLIHOQER NIESLSDLRP 360 

PPQNPRKLIL AGNIIHSLMK SDLVEYPTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 420 

NHLTKLSKGM FLGLHNLEYL YLEYNAIKEZ LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 480 

GVPLTKVNLK TNQFTHLPVS NILDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 

TDDILCTSPG HLDKKELKAL MSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTILRS 600 

55 LTDAVPLSVL ILGLLIMFIT IVFCAAGIW LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 660 

YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 

LLEQENHSPL TGSNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 780 

QLQPDMEAHY PGAHEELKLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 
T 



60 



65 



Seq ID NO: 182 DNA Sequence 

Nucleic Acid Accession #: XW_035292.2 

Coding sequence: 1..1524 



1 11 21 31 41 51 

I I I I 1 I 

ATGGCGGGTG CGGGCCCGAA GCGGCGCGCG CTAGCGGCGC CGGCGGCCGA GGAGAAGGAA 60 

A GAGGCGCGGG AGAAGATGCT GGCCGCCAAG AGCGCGGACG GCTCGGCGCC GGCAGGCGAG 120 

70 GGCGAGGGCG TGACCCTGCA GCGGAACATC ACGCTGCTCA ACGGCGTGGC CATCATCGTG 180 

GGGACCATTA TCGGCTCGGG CATCTTCGTG ACGCCCACGG GCGTGCTCAA GGAGGCAGGC 240 

TCGCCGGGGC TGGCGCTGGT GGTGTGGGCC GCGTGCGGCG TCTTCTCCAT CGTGGGCGGG 300 

CTCTGCTACG CGGAGCTCGG CACCACCATC TCCAAATCGG GCGGCGACTA CGCCTACATG 360 

_ _ CTGGAGGTCT ACGGCTCGCT GCCCGCCTTC CTCAAGCTCT GGATCGAGCT GCTCATCATC 420 

75 CGGCCTTCAT CGCAGTACAT C3GTGGCCCTG GTCTTCGCCA CCTACCTGCT CAAGCCGCTC 480 

TTCCCCACCT GCCCGGTGCC CGAGGAGGCA GCCAAGCTCG TGGCCTGCCT CTGCGTGCTG 540 

CTGCTCACGG CCGTGAACTG CTACAGCGTG AAGGCCGCCA CCCGGGTCCA GGATGCCTTT 600 

GCCGCCGCCA AGCTCCTGGC CCTGGCCCTG ATCATCCTGC TGGGCTTCGT CCAGATCGGA 660 
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AAGGGTGATG TGTCCAATCT AGATCCCAAC TTCTCATTTG AAGGCACCAA ACTGGATGTG 720 

GGGAACATTG TGCTGGCATT ATACAGCGGC CTCTTTGCCT ATGGAGGATG GAATTACTTG 780 

AATTTCGTCA CAGAGGAAAT GATCAACCCC TACAGAAACC TGCCCCTGGC CATCATCATC 840 

TCCCTGCCCA TCGTGACGCT GGTGTACGTG CTGACCAACC TGGCCTACTT CACCACCCTG 900 

5 TCCACCGAGC AGATGCTGTC GTCCGAGGCC GTGGCCGTGG ACTTCGGGAA CTATCACCTG 960 

GGCGTCATGT CCTGGATCAT CCCCGTCTTC GTGGGCCTGT CCTGCTTCGG CTCCGTCAAT 1020 

GGGTCCCTGT TCACATCCTC CAGGCTCTTC TTCGTGGGGT CCCGGGAAGG CCACCTGCCC 1080 

TCCATCCTCT CCATGATCCA CCCACAGCTC CTCACCCCCG TGCCGTCCCT CGTGTTCACG 1140 

TGTGTGATGA CGCTGCTCTA CGCCTTCTCC AAGGACATCT TCTCCGTCAT CAACTTCTTC 1200 

10 AGCTTCTTCA ACTGGCTCTG CGTGGCCCTG GCCATCATCG GCATGATCTG GCTGCGCCAC 1260 

AGAAAGCCTG AGCTTGAGCG GCCCATCAAG GTGAACCTGG CCCTGCCTGT GTTCTTCATC 1320 

CTGGCCTGCC TCTTCCTGAT CGCCGTCTCC TTCTGGAAGA CACCCGTGGA GTGTGGCATC 1380 

GGCTTCACCA TCATCCTCAG CGGGCTGCCC GTCTACTTCT TCGGGGTCTG GTGGAAAAAC 1440 

AAGCCCAAGT GGCTCCTCCA GGGCATCTTC TCCACGACCG TCCTGTGTCA GAAGCTCATG 1500 

15 CAGGTGGTCC CCCAGGAGAC ATAG 

Seq ID NO: 183 Protein Sequence 
Protein Accession ft: XP_035292.2 

20 1 11 21 31 41 51 

i I I I I I 

MAGAGPKRRA LAAPAAEEKE EAREKMLAAK SADGSAPAGE GEGVTLQRNI TLLNGVAI IV 60 

GTIIGSGIFV TPTGVLKEAG SPGLALWWA ACGVPSIVGA LCYAELGTTI SKSGGDYAYM 120 

LEVYGSLPAF LKLWIELLII RPSSQYIVAL VFATYLLKPL PPTCPVPEEA AKLVACLCVL 180 

25 LLTAVNCYSV KAATRVQDAF AAAKLLALAL IILLGPVQIG KGDVSNLDPN FSFEGTKU3V 240 

GNIVLALYSG LFAYGGWNYL NFVTEEMINP YRNLPLAIII SLPIVTLVYV LTNLAYFTTIi 300 

STEQMLSSEA VAVDFGNYHL GVMSWIIPVF VGLSCFGSVN GSLFTSSRLF FVGSREGHLP 360 

SILSMIHPQL LTPVPSLVFT CVMTLLYAFS KDIFSVINFF SFFNWLCVAL AIIGMIWLRH 420 

RXPELERPIK VNLALPVFPI LACLFLIAVS FWKTPVECGI GFTIILSGLP VYFFGVWWKN 480 

30 KPKWL»LQGIF STTVLCQKLM QWPQET 



35 



55 



Seq ID NO: 184 DNA Sequence 

Nucleic Acid Accession #: NMJJ05268.1 

Coding sequence: 1..B22 



1 11 21 31 41 51 

t I I I I I 

ATGAACTGGA GTATCTTTGA GGGACTCCTG AGTGGGGTCA ACAAGTACTC CACAGCCTTT 60 

GGGCGCATCT GGCTGTCTCT GGTCTTCATC TTCCGCGTGC TGGTGTACCT GGTGACGGCC 120 

40 GAGCGTGTGT GGAGTGATGA CCACAAGGAC TTCGACTGCA ATACTCGCCA GCCCGGCTGC 180 

TCCAACGTCT GCTTTGATGA GTTCTTCCCT GTGTCCCATG TGCGCCTCTG GGCCCTGCAG 240 

CTTATCCTGG TGACATGCCC CTCACTGCTC GTGGTCATGC ACGTGGCCTA CCGGGAGGTT 300 

CAGGAGAAGA GGCACCGAGA AGCCCATGGG GAGAACAGTG GGCGCCTCTA CCTGAACCCC 360 

GGCAAGAAGC GGGGTGGGCT CTGGTGGACA TATGTCTGCA GCCTAGTGTT CAAGGCGAGC 420 

45 GTGGACATCG CCTTTCTCTA TGTGTTCCAC TCATTCTACC CCAAATATAT CCTCCCTCCT 480 

GTGGTCAAGT GCCACGCAGA TCCATGTCCC AATATAGTGG ACTGCTTCAT CTCCAAGCCC 540 

TCAGAGAAGA ACATTTTCAC CCTCTTCATG GTGGCCACAG CTGCCATCTG CATCCTGCTC 600 

AACCTCGTGG AGCTCATCTA CCTGGTGAGC AAGAGATGCC ACGAGTGCCT GGCAGCAAGG 660 

AAAGCTCAAG CCATGTGCAC AGGTCATCAC CCCCACGGTA CCACCTCTTC CTGCAAACAA 720 

50 GACGACCTCC TTTCGGGTGA CCTCATCTTT CTGGGCTCAG ACAGTCATCC TCCTCTCTTA 780 
CCAGACCGCC CCCGAGACCA TGTGAAGAAA ACCATCTTGT GA 



Seq ID NO: 185 Protein Sequence 
Protein Accession #: NP_005259.l 



1 11 21 31 41 51 

I I I I I I 

MNWSIFEGLL SGVNKYSTAF GRIWLSLVFI FRVLVYLVTA ERVWSDDHKD FDCNTRQPGC 60 

SNVCFDEFFP VSHVRLWALQ LILVTCPSLL WMHVAYREV QEKRHREAHG ENSGRLYLNP 120 

60 GKKRGGLWWT YVCSLVFKAS VDIAFLYVFH SFYPKYILPP WKCHADPCP NIVDCFISKP 180 

SEKNIFTLFM VATAAICILL NLVELIYLVS KRCHECLAAR KAQAMCTGHH PHGTTSSCKQ 240 
DDLLSGDLIF LGSDSHPPLL PDRPRDHVKK TIL 

Seq ID NO: 1B6 DNA Sequence 
65 Nucleic Acid Accession #: NM_002391.l 
Coding sequence : 2 S . . 4 5 7 

1 11 21 31 41 51 

■7A I I ! 1 1 1 

70 CGGGCGAAGC AGCGOGGGCA GCGAGATGCA GCACCGAGGC TTCCTCCTCC TCACCCTCCT 60 

CGCCCTGCTG GCGCTCACCT CCGCGGTCGC CAAAAAGAAA GATAAGGTGA AGAAGGGCGG 120 

CCCGGGGAGC GAGTGCGCTG AGTGGGCCTG GGGGCCCTGC ACCCCCAGCA GCAAGGATTG 180 

CGGCGTGGGT TTCCGCGAGG GCACCTGCGG GGCCCAGACC CAGCGCATCC GGTGCAGGGT 240 

GCCCTGCAAC TGGAAGAAGG AGTTTGGAGC CGACTGCAAG TACAAGTTTG AGAACTGGGG 300 

75 TGCGTGTGAT GGGGGCACAG GCACCAAAGT CCGCCAAGGC ACCCTGAAGA AGGCGCGCTA 360 

CAATGCTCAG TGCCAGGAGA CCATCCGCGT CACCAAGCCC TGCACCCCCA AGACCAAAGC 420 

AAAGGCCAAA GCCAAGAAAG GGAAGGGAAA GGACTAGACG CCAAGCCTGG ATGCCAAGGA 480 

GCCCCTGGTG TCACATGGGG CCTGGCCACG CCCTCCCTCT CCCAGGCCCG AGATGTGACC 540 
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CACCAGTGCC TTCTGTCTGC TCGTTAGCTT TAATCAATCA TGCCCTGCCT TGTCCCTCTC 600 

ACTCCCCAGC CCCACCCCTA AGTGCCCAAA GTGGGGAGGG ACAAGGGATT CTGGGAAGCT 660 

TGAGCCTCCC CCAAAGCAAT GTGAGTCCCA GAGCCCGCTT TTGTTCTTCC CCACAATTCC 720 

ATTACTAAGA AACACATCAA ATAAACTGAC TTTTTCCCCC CAATAAAAGC TCTTCTTTTT 780 
TAATAT 

Seq ID NO: 187 Protein Sequence 
Protein Accession 8: NP_002382.1 

1 11 21 31 41 51 

I I I I I I 

MQHRGFLLLT LLALLALTSA VAKKKDKVKK GGPGSECAEW AWGPCTPSSK DCGVGFREGT 60 

CGAQTQRIRC RVPCNWKKEF GADCKYKFEN WGACDGGTGT KVRQGTLKKA RYNAQCQETI 120 
RVTKPCTPKT KAKAKAKKGK GKD 

Seq ID NO: 188 DNA Sequence 
Nucleic Acid Accession #: NMJD15507 
Coding sequence: 241-1902 

1 11 21 31 41 51 

I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 

CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 

OGAGTGGAGC GGAGGACCCG AGCGGCTGAG GAGAGAGGAG GCGGCGGCTT AGCTGCTACG 180 

GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 

ATGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 

GGGAACGOGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 

TGTCACTATG GAACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 

TGTGAAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 

AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 

AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 

CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 

ATAAACTGTC AGTACAGCTG TGAAGACACA GAAGAAGGGC CACAGTGCCT GTGTCCATCC 720 

TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 

GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 

AAATGTCACA TTGG7TTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 

AATGAATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 

GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 

ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 

AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1140 

CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 

ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 

GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 

AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTQATT 1380 

CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 

GACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 

TGGAATCCTG CTGATCGAGA TAATGCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1560 

GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 

AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 

TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 

TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 

GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 

TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGACT GAATGTTACT ATCTTTATAT 1920 

TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 

TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 

TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 

TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 

CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 

TAGAAAAAAA AGCACAGAGA AATGTTTAAC TGTTTGACTC TTATGATACT TCTTGGAAAC 2280 

TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGICTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 

Seq ID NOs 189 Protein Sequence 
Protein Accession 8: NP_056322.2 

1 11 21 31 41 51 

I I I I 1 I 

MPLFWSLALP LLLSWVAGGP GNAASARHHG LLASARQPGV CHYGTKIiACC YGWRRNSKGV 60 

CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 

LSQHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 

GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 

GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTIKDRI KKLLAHKNSM KKKAKIKNVT 300 

PBPTRTPTPK VNLOPFNYEE IVSRGGNSHG GKKGNBEKMK EGLEDEKREE KALKNDIEER 360 

SLRGDVFPPK VNEAGBFGLI LVQRKALTSK LEHKDLNISV DCSFNHGICD WKQDREDDFD 420 

WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 

FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLIiS VDD 
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Seq ID NO: 190 DNA Sequence 
Nucleic Acid Accession #: NM_006475 
Coding sequence: 12.. 2522 



1 11 21 31 41 51 

I 1 I I I I 

AGAGACTCAA GATGATTCCC TTTTTACCCA TGTTTTCTCT ACTATTGCTG CTTATTGTTA 60 

ACCCTATAAA CGCCAACAAT CATTATGACA AQATCTTGGC TCATAGTCGT ATCAGGGGTC 120 

GGGACCAAGG CCCAAATGTC TGTGCCCTTC AACAGATTTT GGGCACCAAA AAGAAATACT 180 

TCAGCACTTG TAAGAACTGG TATAAAAAGT CCATCTGTGG ACAGAAAACG ACTGTTTTAT 240 

ATGAATGTTG CCCTGGTTAT ATGAGAATGG AAGGAATGAA AGGCTGCCCA GCAGTTTTGC 300 

CCATTGACCA TGTTTATGGC ACTCTGGGCA TCGTGGGAGC CACCACAACG CAGCGCTATT 3 60 

CTGACGCCTC AAAACTGAGG GAGGAGATCG AGGGAAAGGG ATCCTTCACT TACTTTGCAC 420 

CGAGTAATGA GGCTTGGGAC AACTTGGATT CTGATATCCG TAGAGGTTTG GAGAGCAACG 480 

TGAATGTTGA ATTACTGAAT GCTTTACATA GTCACATGAT TAATAAGAGA ATGTTGACCA 540 

AGGACTTAAA AAATGGCATG ATTATTCCTT CAATGTATAA CAATTTGGGG CTTTTCATTA 600 

ACCATTATCC TAATGGGGTT GTCACTGTTA ATTGTGCTCG AATCATCCAT GGGAACCAGA 660 

TTGCAACAAA TGGTGTTGTC CATGTCATTG ACCGTGTGCT TACACAAATT GGTACCTCAA 720 

TTCAAGACTT CATTGAAGCA GAAGATGACC TTTCATCTTT TAGAGCAGCT GCCATCACAT 780 

CGGACATATT GGAGGCCCTT GGAAGAGACG GTCACTTCAC ACTCTTTGCT CCCACCAATG 840 

AGGCTTTTGA GAAACTTCCA CGAGGTGTCC TAGAAAGGTT CATGGGAGAC AAAGTGGCTT 900 

CCGAAGCTCT TATGAAGTAC CACATCTTAA ATACTCTCCA GTGTTCTGAG TCTATTATGG 960 

GAGGAGCAGT CTTTGAGACG CTGGAAGGAA ATACAATTGA GATAGGATGT GACGGTGACA 1020 

GTATAACAGT AAATGGAATC AAAATGGTGA ACAAAAAGGA TATTGTGACA AATAATGGTG 1080 

TGATCCATTT GATTGATCAG GTCCTAATTC CTGATTCTGC CAAACAAGTT ATTGAGCTGG 1140 

CTGGAAAACA GCAAACCACC TTCACGGATC TTGTGGCCCA ATTAGGCTTG GCATCTGCTC 1200 

TGAGGCCAGA TGGAGAATAC ACTTTGCTGG CACCTGTGAA TAATGCATTT TCTGATGATA 1260 

CTCTCAGCAT GGTTCAGCGC CTCCTTAAAT TAATTCTGCA GAATCACATA TTGAAAGTAA 1320 

AAGTTGGCCT TAATGAGCTT TACAACGGGC AAATACTGGA AACCATCGGA GGCAAACAGC 1380 

TCAGAGTCTT CGTATATCGT ACAGCTGTCT GCATTGAAAA TTCATGCATG GAGAAAGGGA 1440 

GTAAGCAAGG GAGAAACGGT GCGATTCACA TATTCCGCGA GATCATCAAG CCAGCAGAGA 1500 

AATCCCTCCA TGAAAAGTTA AAACAAGATA AGCGCTTTAG CACCTTCCTC AGCCTACTTG 1560 

AAGCTGCAGA CTTGAAAGAG CTCCTGACAC AACCTGGAGA CTGGACATTA TTTGTGCCAA 1620 

CCAATGATGC TTTTAAGGGA ATGACTAGTG AAGAAAAAGA AATTCTGATA CGGGACAAAA 1680 

ATGCTCTTCA AAACATCATT CTTTATCACC TGACACCAGG AGTTTTCATT GGAAAAGGAT 1740 

TTGAACCTGG TGTTACTAAC ATTTTAAAGA CCACACAAGG AAGCAAAATC TTTCTGAAAG 1800 

AAGTAAATGA TACACTTCTG GTGAATGAAT TGAAATCAAA AGAATCTGAC ATCATGACAA 1860 

CAAATGGTGT AATTCATGTT GTAGATAAAC TCCTCTATCC AGCAGACACA CCTGTTGGAA 1920 

ATGATCAACT GCTGGAAATA CTTAATAAAT TAATCAAATA CATCCAAATT AAGTTTGTTC 1980 

GTGGTAGCAC CTTCAAAGAA ATCCCCGTGA CTGTCTATAC AACTAAAATT ATAACCAAAG 2040 

TTGTGGAACC AAAAATTAAA GTGATTGAAG GCAGTCTTCA GCCTATTATC AAAACTGAAG 2100 

GACCCACACT AACAAAAGTC AAAATTGAAG GTGAACCTGA ATTCAGACTG ATTAAAGAAG 2160 

GTGAAACAAT AACTGAAGTG ATCCATGGAG AGCCAATTAT TAAAAAATAC ACCAAAATCA 2220 

TTGATGGAGT GCCTGTGGAA ATAACTGAAA AAGAGACACG AGAAGAACGA ATCATTACAG 2280 

GTCCTGAAAT AAAATACACT AGGATTTCTA CTGGAGGTGG AGAAACAGAA GAAACTCTGA 2340 

AGAAATTGTT ACAAGAAGAG GTCACCAAGG TCACCAAATT CATTGAAGGT GGTGATGGTC 2400 

ATTTATTTGA AGATGAAGAA ATTAAAAGAC TGCTTCAGGG AGACACACCC GTGAGGAAGT 2460 

TGCAAGCCAA CAAAAAAGTT CAAGGTTCTA GAAGACGATT AAGGGAAGGT CGTTCTCAGT 2520 

GAAAATCCAA AAACCAGAAA AAAATGTTTA TACAACCCTA AGTCAATAAC CTGACCTTAG 2580 

AAAATTGTGA GAGCCAAGTT GACTTCAGGA ACTGAAACAT CAGCACAAAG AAGCAATCAT 2640 

CAAATAATTC TGAACACAAA TTTAATATTT TTTTTTCTGA ATGAGAAACA TGAGGGAAAT 2700 

TGTGGAGTTA GCCTCCTGTG GTAAAGGAAT TGAAGAAAAT ATAACACCTT ACACCCTTTT 2760 

TCATCTTGAC ATTAAAAGTT CTGGCTAACT TTGGAATCCA TTAGAGAAAA ATCCTTGTCA 2820 

CCAGATTCAT TACAATTCAA ATCGAAGAGT TGTGAACTGT TATCCCATTG AAAAGACCGA 2880 

GCCTTGTATG TATGTTATGG ATACATAAAA TGCACGCAAG CCATTATCTC TCCATGGGAA 2940 

GCTAAGTTAT AAAAATAGGT GCTTGGTGTA CAAAACTTTT TATATCAAAA GGCTTTGCAC 3000 

ATTTCTATAT GAGTGGGTTT ACTGGTAAAT TATGTTATTT TTTACAACTA ATTTTGTACT 3060 

CTCAGAATGT TTGTCATATG CTTCTTGCAA TGCATATTTT TTAATCTCAA ACGTTTCAAT 3120 

AAAACCATTT TTCAGATATA AAGAGAATTA CTTCAAATTG AGTAATTCAG AAAAACTCAA 3180 
GATTTAAGTT AAAAAGTGGT TTGGACTTGG GAA 



Seq ID NO: 191 Protein Sequence 
Protein Accession #t BAA02836.1 



1 11 21 31 41 51 

I I I I I I 

MIPFLPMFSL LLLLIVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQIL GTKKKYFSTC 60 

KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 

KbREEIEGKG SFTYFAPSNE AWDNLDSDIR RGLESNVNVE LLNALHSHMI NKRMLTKDLK 180 

NGMIIPSMYN NLGLFINHYP NGWTVNCAR IIHGNQIATN GWHVIDRVL TQIGTSIQDF 240 

IEAEDDLSSF RAAAITSDIL EALGRDGHFT LFAPTNEAFE KLPRGVLERF MGDKVASEAL 300 

MKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 

IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 

VQRLLKLILQ NHILKVKVGL NELYNGQILE TIGGKQLRVF VYRTAVCIEN SCMEKGSKQG 480 

RNGAIHIFRE IIKPAEKSLH EKLKQDKRFS TFLSLLEAAD LKELLTQPGD WTLFVPTNDA 540 

FKGMTSBEKE ILIRDKNALQ NIILYHLTPG VFIGKGPEPG VTNILKTTQG SKIFLKEVND 600 

TLLVNELKSK ESDIMTTNGV IHWDKLLYP ADTPVGNDQL LEILNKLIKY IQIKFVRGST 660 
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PKEIPVTVYT TKIITKWEP KIKVIEGSLQ PIIKTEGPTL TKVKIEGEPE PRLIKEGETI 720 

TEVIHGEPII KKYTKIIDGV PVEITEKETR EERIITGPEI KYTRISTGGG ETEETLKKLL 780 
QBEVTKVTKF IEGGDGHLFE DEEIKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

5 Seq ID NO: 192 DNA Sequence 

Nucleic Acid Accession fti NM_006670 
Coding sequence: 85.. 1347 

1 11 21 31 41 51 

10 | | i 111 

CCGGCTCGCG CCCTCCGGGC CCAGCCTCCC GAGCCTTCGG AGCGGGCGCC GTCCCAGCCC 60 

AGCTCCGGGG AAACGCGAGC CGCGATGCCT GGGGGGTGCT CCCGGGGCCC CGCCGCCGGG 120 

GACGGGCGTC TGCGGCTGGC GCGACTAGCG CTGGTACTCC TGGGCTGGGT CTCCTCGTCT 180 

TCTCCCACCT CCTCGGCATC CTCCTTCTCC TCCTCGGCGC CGTTCCTGGC TTCCGCCGTG 240 

15 TCCGCCCAGC CCCCGCTGCC GGACCAGTGC CCCGCGCTGT GCGAGTGCTC CGAGGCAGCG 300 

CGCACAGTCA AGTGCGTTAA CCGCAATCTG ACCGAGGTGC CCACGGACCT GCCCGCCTAC 3 60 

GTGCGCAACC TCTTCCTTAC CGGCAACCAG CTGGCCGTGC TCCCTGCCGG CGCCTTCGCC 420 

CGCCGGCCGC CGCTGGCGGA GCTGGCCGCG CTCAACCTCA GCGGCAGCCG CCTGGACGAG 480 

GTGCGCGCGG GCGCCTTCGA GCATCTGCCC AGCCTGCGCC AGCTCGACCT CAGCCACAAC 540 

20 CCACTGGCCG ACCTCAGTCC CTTCGCTTTC TOGGGCAGCA ATGCCAGCGT CTOGGCCCCC 600 

AGTCCCCTTG TGGAACTGAT CCTGAACCAC ATCGTGCCCC CTGAAGATGA GCGGCAGAAC 660 

CGGAGCTTCG AGGGCATGGT GGTGGCGGCC CTGCTGGCGG GCCGTGCACT GCAGGGGCTC 720 

CGCCGCTTGG AGCTGGCCAG CAACCACTTC CTTTACCTGC CGCGGGATGT GCTGGCCCAA 780 

CTGCCCAGCC TCAGGCACCT GGACTTAAGT AATAATTCGC TGGTGAGCCT GACCTACGTG 840 

25 TCCTTCCGCA ACCTGACACA TCTAGAAAGC CTCCACCTGG AGGACAATGC CCTCAAGGTC 900 

CTTCACAATG GCACCCTGGC TGAGTTGCAA GGTCTACCCC ACATTAGGGT TTTCCTGGAC 960 

AACAATCCCT GGGTCTGCGA CTGCCACATG GCAGACATGG TGACCTGGCT CAAGGAAACA 1020 

GAGGTAGTGC AGGGCAAAGA CCGGCTCACC TGTGCATATC CGGAAAAAAT GAGGAATCGG 1080 

GTCCTCTTGG AACTCAACAG TGCTGACCTG GACTGTGACC CGATTCTTCC CCCATCCCTG 1140 

30 CAAACCTCTT ATGTCTTCCT GGGTATTGTT TTAGCCCTGA TAGGCGCTAT TTTCCTCCTG 1200 

GTTTTGTATT TGAACCGCAA GGGGATAAAA AAGTGGATGC ATAACATCAG AGATGCCTGC 1260 

AGGGATCACA TGGAAGGGTA TCATTACAGA TATGAAATCA ATGCGGACCC CAGATTAACA 1320 

AACCTCAGTT CTAACTCGGA TGTCTGAGAA ATATTAGAGG ACAGACCAAG GACAACTCTG 1380 

CATGAGATGT AGACTTAAGC TTTATCCCTA CTAGGCTTGC TCCACTTTCA TCCTCCACTA 1440 

35 TAGATACAAC GGACTTTGAC TAAAAGCAGT GAAGGGGATT TGCTTCCTTG TTATGTAAAG 1500 

TTTCTCGGTG TGTTCTGTTA ATGTAAGACG ATGAACAGTT GTGTATAGTG TTTTACCCTC 1560 

TTCTTTTTCT TGGAACTCCT CAACACGTAT GGAGGGATTT TTCAGGTTTC AGCATGAACA 1620 

TGGGCTTCTT GCTGTCTGTC TCTCTCTCAG TACAGTTCAA GGTGTAGCAA GTGTACCCAC 1680 

ACAGATAGCA TTCAACAAAA GCTGCCTCAA CTTTTTCGAG AAAAATACTT TATTCATAAA 1740 

40 TATCAGTTTT ATTCTCATGT ACCTAAGTTG TGGAGAAAAT AATTGCATCC TATAAACTGC 1800 

CTGCAGACGT TAGCAGGCTC TTCAAAATAA CTCCATGGTG CACAGGAGCA CCTGCATCCA 1860 

AGAGCATGCT TACATTTTAC TGTTCTGCAT ATTACAAAAA ATAACTTGCA ACTTCATAAC 1920 

TTCTTTGACA AAGTAAATTA CTTTTTTGAT TGCAGTTTAT ATGAAAATGT ACTGATTTTT 1980 

TTTTAATAAA CTGCATCGAG ATCCAACCGA CTGAATTGTT AAAAAAAAAA AAAAATAAAG 2040 
ATTCTTAAAA GAA 

Seq ID HO: 193 Protein Sequence 
Protein Accession #i CAA82324.1 

50 1 11 21 31 41 51 

1 I I 1 1 I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPPIAS AVSAQPPLPD 60 

QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLAVLPAGA FARRPPLAEL "120 

AALNLSGSRL DEVRAGAFEH LPSLRQLDLS HNPLADLSPP AFSGSNASVS APSPLVELIL 180 

55 NHIVPPEDER QNRSFEGMW AALLAGRALQ GLRRLELASN HPLYLPRDVL AQLPSLRHLD 240 

LSNKSLVSLT YVSFRNLTHL ESLHLEDNAL KVLHNGTLAE LQGLPHIRVF LDNNPWVCDC 300 

HMADMVTWLK ETEWQGKDR LTCAYPEKMR NRVLLELNSA DLDCDPILPP SLQTSYVFLG 360 
IVLALIGAIF LLVLYLNRKG IKKWMHHIRD ACRDHMEGYH YRYEINADPR LTNLSSNSDV 

60 Seq ID NO: 194 DNA Sequence 

Nucleic Acid Accession 8: NM_014400 
Coding sequence: 86.. 1126 

65 1 11 21 31 41 51 

I I 1 I I I 

GGTTACTCAT CCTGGGCTCA GGTAAGAGGG CCCGAGCTCG GAGGCGGCAC ACCCAGGGGG 60 

GACGCCAAGG GAGCAGGACG GAGCCATGGA CCCCGCCAGG AAAGCAGGTG CCCAGGCCAT 120 

GATCTGGACT GCAGGCTGGC TGCTGCTGCT GCTGCTTCGC GGAGGAGCGC AGGCCCTGGA 180 

70 GTGCTACAGC TGCGTGCAGA AAGCAGATGA CGGATGCTCC CCGAACAAGA TGAAGACAGT 240 

GAAGTGCGCq CCGGGCGTGG ACGTCTGCAC CGAGGCCGTG GGGGCGGTGG AGACCATCCA 300 

CGGACAATTC TCGCTGGCAG TGCSGGGTTG CGGTTCGGGA CTCCCCGGCA AGAATGACCG 360 

CGGCCTGGAT CTTCACGGGC TTCTGGCGTT CATCCAGCTG CAGCAATGCG CTCAGGATCG 420 

CTGCAACGCC AAGCTCAACC TCACCTCGCG GGCGCTCGAC CCGGCAGGTA ATGAGAGTGC 480 

75 ATACCCGCCC AACGGCGTGG AGTGCTACAG CTGTGTGGGC CTGAGCCGGG AGGCGTGCCA 540 

GGGTACATCG CCGCCGGTCG TGAGCTGCTA CAACGCCAGC GATCATGTCT ACAAGGGCTG 600 

CTTGGACGGC AACGTCACCT TGACGGCAGC TAATGTGACT GTGTCCTTGC CTGTCCGGGG 660 

CTGTGTCCAG GATGAATTCT GCACTCGGGA TGGAGTAACA GGCCCAGGGT TCACGCTCAG 720 
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TGGCTCCTGT TGCCAGGGGT CCCGCTGTAA CTCTGACCTC CGCAACAAGA CCTACTTCTC 780 

CCCTCGAATC CCACCCCTTG TCCGGCTGCC CCCTCCAGAG CCCACGACTG TGGCCTCAAC 840 

CACATCTGTC ACCACTTCTA CCTCGCCCCC AGTGAGACCC ACATCCACCA CCAAACCCAT 900 

GCCAGCGCCA ACCAGTCAGA CTCCGAGACA GGGAGTAGAA CACGAGGCCT CCCGGGATGA 960 

5 GGAGCCCAGG TTGACTGGAG GCGCCGCTGG CCACCAGGAC CGCAGCAATT CAGGGCAGTA 1020 

TCCTGCAAAA GGGGGGCCCC AGCAGCCCCA TAATAAAGGC TGTGTGGCTC CCACAGCTGG 1080 

ATTGGCAGCC CTTCTGTTGG CCGTGGCTGC TGGTGTCCTA CTGTGAGCTT CTCCACCTGG 1140 

AAATTTCCCT CTCACCTACT TCTCTGGCCC TGGGTACCCC TCTTCTCATC ACTTCCTGTT 1200 

CCCACCACTG GACTGGGCTG GCCCAGCCCC TGTTTTTCCA ACATTCCCCA GTATCCCCAG 1260 

10 CTTCTGCTGC GCTGGTTTGC GGCTTTGGGA AATAAAATAC CGTTGTATAT ATTCTGGCAG 1320 

GGGTGTTCTA GCTTTTTGAG GACAGCTCCT GTATCCTTCT CATCCTTGTC TCTCCGCTTG 1380 

TCCTCTTGTG ATGTTAGGAC AGAGTGAGAG AAGTCAGCTG TCACGGGGAA GGTGAGAGAG 1440 

AGGATGCTAA GCTTCCTACT CACTTTCTCC TAGCCAGCCT GGACTTTGGA GCGTGGGGTG 1500 

GGTGGGACAA TGGCTCCCCA CTCTAAGCAC TGCCTCCCCT ACTCCCCGCA TCTTTGGGGA 1560 

15 ATCGGTTCCC CATATGTCTT CCTTACTAGA CTGTGAGCTC CTCGAGGGCA GGGACCGTGC 1620 

CTTATGTCTG TGTGTGATCA GTTTCTGGCA CATAAATGCC TCAATAAAGA TTTAATTACT 1680 
TTGTATAGTG AAAAAAAA 

Seq ID NO: 195 Protein Sequence 
20 Protein Accession 8: NPJJ55215 

1 11 21 31 41 51 

I I I I I I 

MDPARKAGAQ AMIWTAGWLL LLLLRGGAQA LECYSCVQKA DDGCSPNKMK TVKCAPGVDV 60 

25 CTEAVGAVET IHGQFSLAVX GCGSGLPGKN DRGIjDLHGLL AFIQLQQCAQ DRCNAKLNLT 120 

SRALDPAGNE SAYPPNGVEC YSCVGLSREA CQGTSPPWS CYNASDHVYX GCPDGNVTLT 180 

AANVTVSLPV RGCVQDEFCT RDGVTGPGPT LSGSCCQGSR CNSDLRNKTY FSPRIPPLVR 240 

LPPPEPrrVA STTSVTTSTS APVRPTSTTK PMPAPTSQTP RQGVEHEASR DEEPRLTGGA 300 
AGHQDRSNSG QYPAKGGPQQ PHNKGCVAPT AGLAALLLAV AAGVLL 

Seq ID NO: 196 DNA Sequence 
Nucleic Acid Accession #: NMJJ06536 
Coding sequence: 109.. 2940 *~ 

35 1 11 21 31 41 51 

I I 1 I I I 

ACCTAAAACC TTGCAAGTTC AGGAAGAAAC CATCTGCATC CATATTGAAA ACCTGACACA 60 

ATGTATGCAG CAGGCTCAGT GTGAGTGAAC TGGAGGCTTC TCTACAACAT GACCCAAAGG 120 

AGCATTGCAG GTCCTATTTG CAACCTGAAG TTTGTGACTC TCCTGGTTGC CTTAAGTTCA 180 

40 GAACTCCCAT TCCTGGGAGC TGGAGTACAG CTTCAAGACA ATGGGTATAA TGGATTGCTC 240 

ATTGCAATTA ATCCTCAGGT ACCTGAGAAT CAGAACCTCA TCTCAAACAT TAAGGAAATG 300 

ATAACTGAAG CTTCATTTTA CCTATTTAAT GCTACCAAGA GAAGAGTATT TTTCAGAAAT 360 

ATAAAGATTT TAATACCTGC CACATGGAAA GCTAATAATA ACAGCAAAAT AAAACAAGAA 420 

TCATATGAAA AGGCAAATGT CATAGTGACT GACTGGTATG GGGCACATGG AGATGATCCA 480 

45 TACACCCTAC AATACAGAGG GTGTGGAAAA GAGGGAAAAT ACATTCATTT CACACCTAAT 540 

TTCCTACTGA ATGATAACTT AACAGCTGGC TACGGATCAC GAGGCCGAGT GTTTGTCCAT 600 

GAATGGGCCC ACCTCCGTTG GGGTGTGTTC GATGAGTATA ACAATGACAA ACCTTTCTAC 660 

ATAAATGGGC AAAATCAAAT TAAAGTGACA AGGTGTTCAT CTGACATCAC AGGCATTTTT 720 

GTGTGTGAAA AAGGTCCTTG CCCCCAAGAA AACTGTATTA TTAGTAAGCT TTTTAAAGAA 780 

50 GGATGCACCT TTATCTACAA TAGCACCCAA AATGCAACTG CATCAATAAT GTTCATGCAA 840 

AGTTTATCTT CTGTGGTTGA ATTTTGTAAT GCAAGTACCC ACAACCAAGA AGCACCAAAC 900 

CTACAGAACC AGATGTGCAG CCTCAGAAGT GCATGGGATG TAATCACAGA CTCTGCTGAC 960 

TTTCACCACA GCTTTCCCAT GAATGGGACT GAGCTTCCAC CTCCTCCCAC ATTCTCGCTT 1020 

GTACAGGCTG GTGACAAAGT GGTCTGTTTA GTGCTGGATG TGTCCAGCAA GATGGCAGAG 1080 

55 GCTGACAGAC TCCTTCAACT ACAACAAGCC GCAGAATTTT ATTTGATGCA GATTGTTGAA 1140 

ATTCATACCT TCGTGGGCAT TGCCAQTTTC GACAGCAAAG GAGAGATCAG AGCCCAGCTA 1200 

CACCAAATTA ACAGCAATGA TGATCGAAAG TTGCTGGTTT CATATCTGCC CACCACTGTA 1260 

TCAGCTAAAA CAGACATCAG CATTTGTTCA GGGCTTAAGA AAGGATTTGA GGTGGTTGAA 1320 

AAACTGAATG GAAAAGCTTA TGGCTCTGTG ATGATATTAG TGACCAGCGG AGATGATAAG 1380 

60 CTTCTTGGCA ATTGCTTACC CACTGTGCTC AGCAGTGGTT CAACAATTCA CTCCATTGCC 1440 

CTGGGTTCAT CTGCAGCCCC AAATCTGGAG GAATTATCAC GTCTTACAGG AGGTTTAAAG 1500 

TTCTTTGTTC CAGATATATC AAACTCCAAT AGCATGATTG ATGCTTTCAG TAGAATTTCC 1560 

TCTGGAACTG GAGACATTTT CCAGCAACAT ATTCAGCTTG AAAGTACAGG TGAAAATGTC 1620 

AAACCTCACC ATCAATTGAA AAACACAGTG ACTGTGGATA ATACTGTGGG CAACGACACT 1680 

65 ATGTTTCTAG TTACGTGGCA GGCCAGTGGT CCTCCTGAGA TTATATTATT TGATCCTGAT 1740 

GGACGAAAAT ACTACACAAA TAATTTTATC ACCAATCTAA CTTTTCGGAC AGCTAGTCTT 1800 

TGGATTCCAG GAACAGCTAA GCCTGGGCAC TGGACTTACA CCCTGAACAA TACCCATCAT 1860 

TCTCTGCAAG CCCTGAAAGT GACAGTGACC TCTCGCGCCT CCAACTCAGC TGTGCCCCCA 1920 

A GCCACTGTGG AAGCCTTTGT GGAAAGAGAC AGCCTCCATT TTCCTCATCC TGTGATGATT 1980 

70 TATGCCAATG TGAAACAGGG ATTTTATCCC ATTCTTAATG CCACTGTCAC TGCCACAGTT 2040 

GAGCCAGAGA CTGGAGATCC TGTTACGCTG AGACTCCTTG ATGATGGAGC AGGTGCTGAT 2100 

GTTATAAAAA ATGATGGAAT TTACTCGAGG TATTTTTTCT CCTTTGCTGC AAATGGTAGA 2160 

TATAGCTTGA AAGTGCATGT CAATCACTCT CCCAGCATAA GCACCCCAGC CCACTCTATT 2220 

CCAGGGAGTC ATGCTATGTA TGTACCAGGT TACACAGCAA ACGGTAATAT TCAGATGAAT 2280 

75 GCTCCAAGGA AATCAGTAGG CAGAAATGAG GAGGAGCGAA AGTGGGGCTT TAGCCGAGTC 2340 

AGCTCAGGAG GCTCCTTTTC AGTGCTGGGA GTTCCAGCTG GCCCCCACCC TGATGTGTTT 2400 

CCACCATGCA AAATTATTGA CCTGGAAGCT GTAAAAGTAG AAGAGGAATT GACCCTATCT 2460 

TGGACAGCAC CTGGAGAAGA CTTTGATCAG GGCCAGGCTA CAAGCTATGA AATAAGAATG 2520 
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5 

10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



AGTAAAAGTC 
AAGCGAAATC 
ACGAATGGAC 
GCAATACGAG 
CCTCTGTTTA 
GGAGTTTTAA 
CATACTTTAA 
ATAAATATCC 
CATACTAACA 
ATACAGATAA 
CCTTACACTT 
GCAAAGGGAA 
AATAGCCCCA 
TCATTTAGTT 
TTTACATGAA 
CTTGCTATTT 
TTTCACTGTA 
TTTATGACAA 
TTTCTAAGTT 
TACCTAGGAA 



TACAGAATAT 
CTCAGCAAGC 
CTGAACATCA 
CAATGGATAG 
TTCCCCCCAA 
CAGCAATGGG 
GCAGGAAAAA 
AAAGTGTCTT 
AAGTCAAATT 
GATTTTTACA 
TGGCTATGAA 
GGGTAAAGTC 
AGCAGAGAAA 
ACTTTGATTA 
GATCATGCTA 
TGTTATATAT 
AGAGGTAACC 
AGGTCTATTG 
TATTGCCTTG 
A 



CCAAGATGAC 
TGGCATCAGG 
GCCAAATGGA 
GAACTCCTTA 
TTCTGATCCT 
TTTGATAGGA 
GAGAGCAGAC 
CCTTCTTAGA 
AACATCAAAA 
TGGTAGATCA 
CAAATAATAA 
GGACCAGTGT 
AGGAGGGTAG 
ATTTTTCTTT 
TATTTTATAT 
ATTTCAGATG 
TTTAACAATA 
AATTTATTTG 
GGTTATTATG 



Seq ID NO: 197 Protein Sequence 
Protein Accession #: NP_006S27 



i 

MTQRSIAGPI 
IKEMITEASF 
GDDPYTLQYR 
KPFYINGQNQ 
MFMQSLSSW 
TPSLVQAGDK 
RAQLHQINSN 
GDDKLLGNCL 
SRISSGTGDI 
FDPDGRKYYT 
AVPPATVEAF 
AGADVIKNDG 
IQMNAPRKSV 
LTLSWTAPGE 
PQISTNGPEH 
LILKGVLTAM 



11 
I 

CNLKFVTLLV 
YLFNATKRRV 
GCGKEGKYIH 
IKVTRCSSDI 
EFCNASTHNQ 
WCLVLDVSS 
DDRXLLVSYL 
PTVLSSGSTI 
PQQHIQLEST 
NNFITNLTFR 
VERDSLHFPH 
IYSRYFFSFA 
GRNEEERKWG 
DFDQGQATSY 
QPNGETHESH 
GLIGIICLII 



21 
I 

ALSSELPFLG 
FFRNIKILIP 
FTPNFLLNDN 
TGIFVCEKGP 
EAPNLQNQMC 
KMAEADRLLQ 
PTTVSAKTDI 
HSIALGSSAA 
GENVKPHHQL 
TASLWIPGTA 
PVMIYANVKQ 
ANGRY SLKVH 
FSRVSSGGSF 
EIRMSKSLQN 
RIYVAIRAMD 
WTHHTLSRK 



TTTAACAATG 
GAGATATTTA 
GAAACACATG 
CAGTCTGCTG 
GTACCTGCCA 
ATCATTTGCC 
AAGAAAGAGA 
TATAAGACCC 
CTGTATTAAA 
ACAATTCTTT 
AAATTATTCT 
CAAGGAAAGT 
GTCTGCATTA 
TCTCCTTATC 
ATGTAGCCCC 
ACATCTCCCT 
TGGGTATTAC 
TNTGTAAGTT 
GAATGATAGT 



31 
I 

AGVQLQDNGY 
ATWKANNNSK 
LTAGYGSRGR 
CPQENCIISK 
SLRSAWDVIT 
LQQAAEFYLM 
SICSGLKKGF 
PNLEELSRLT 
KNTVTVDNTV 
KPGHWTYTLN 
GFYPILNATV 
VNHSPSISTP 
SVLGVPAGPH 
IQDDFNNAIL 
RNSLQSAVSN 
KRADKKENGT 



Seq ID NO: 198 DNA Sequence 
Nucleic Acid Accession #: NM_0 01944 
Coding sequence: 84.. 3 083 



1 
I 

TTTTCTTAGA 
TTTCACCAGG 
CCATCTTCGT 
ATGATGAAGA 
AATTTGCCAA 
TTACTTCAGA 
ATCAGCCGCC 
CTATAGTCGA 
AAGGACTAGA 
ATCCTCCAGT 
ACTCACTGGT 
AAATTGCCTT 
GAAACACTGG 
ATCGTCTGGT 
GTAATATTAA 
CAGCACGTAT 
TGGATGAAGA 
GAAATTGGTT 
AGGCTCTAGA 
CTGAATTTCA 
AGGTAATAAA 
AAAAAGGCAT 
ATGAGGACAC 
GATACCTAAT 
ATTCTACTTT 
CGGGTAAAAC 
CAACAGCTGT 
CTAGAACACT 



11 
I 

CATTAACTGC 
GAAATCAGAG 
GGTGGTCATA 
AGAGATGACT 
ACCCTGCAGA 
TTACCAAGCA 
TTTTGGAATC 
CCGGGAGGAA 
TGTAGAGAAA 
ATTTTCACAA 
GATGATACTA 
CAAAATTGTC 
GGAAGTCCGT 
TGTGAGTGGT 
AGTGAAAGAT 
TGAAGAAAAT 
GTACACAGAT 
TGAAATACAA 
TTATGAACAA 
CCAATCAGTT 
TGTAAGAGAA 
AAGTAGCAAA 
TAACAAAGCT 
GATTGATTCA 
CATAGTTAAC 
TTCTACAGGC 
CCTCGAAAAA 
GAATAATAGA 



21 
I 

AGACGGCTGG 
ACAATGATGG 
TTGGTTCATG 
ATGCAACAAG 
GAAGGAGAAG 
ACCCAGAAAA 
TTTGTTGTTG 
ACTCCAAGCT 
CCACTTATAC 
CAAATTTTCA 
AATGCCACAG 
TCTCAGGAAC 
ACTTTGACCA 
GCAGACAAAG 
GTCAACGATA 
ATTTTAAGTT 
AATTGGCTTG 
ACTGATCCTA 
CTACAAAGCG 
ATCTCTCGAT 
GGAATTGCAT 
AAATTGGTGG 
GCCTCAAATG 
AAAACTGCTG 
AAAACAATCA 
ACGGTATATG 
GATGCAGTTT 
TACACTGGCC 



31 
I 

CAGGATAGAA 
GGCTCTTCCC 
GAGAATTGCG 
CTAAAAGAAG 
ATAACTCAAA 
TCACCTACCG 
ACAAAAACAC 
TCCTGATCAC 
TAACGGTTAA 
TGGGTGAAAT 
ATGCAGATGA 
CAGCAGGCAC 
ATTCTCTTGA 
ATGGAGAAGG 
ACTTCCCAAT 
CTGAATTACT 
CAGTATATTT 
GAACTAATGA 
TGAAACTTAG 
ACCGAGTTCA 
TCCGTCCTGC 
ATTATATCCT 
TCAAATATGT 
AAATCAAATT 
CAGCTGAGGT 
TTAGAGTACC 
GCAGTTCTTC 
CCTATACATT 



CTATTTTAGT 
CGTTCTCACC 
AAAGCCACAG 
TATCTAACAT 
GAGATTATCT 
TTATTATAGT 
ATGGAACAAA 
ATGGCCTTCG 
ATGCATTGAG 
TTGGGGGTAG 
TTAAAGTAAT 
TTGTTTTATT 
TAACTGTCTG 
TGTGCAGTAC 
TAATGCAAAG 
GCTAATGCTC 
CTTTGTCTCT 
TCTACTCCCA 
TATAGCCCCN 



41 

I 

NGLLIAINPQ 
IKQESYEKAN 
VFVHEWAHLR 
LFKEGCTFIY 
DSADFHHSPP 
QIVEIHTFVG 
EWEKLNGKA 
GGLKFFVPDI 
GNDTMFLVTW 
NTHHSLQALK 
TATVEPETGD 
AH S IPG SHAM 
PDVFPPCKII 
VNTSKRNPQQ 
IAQAPLFIPP 
KLL 



41 
I 

GCAGCGGCTC 
CAGAACTACA 
AATAGAGACT 
GCAAAAACGT 
AAGAAACCCA 
AATCTCTGGA 
TGGAGATATT 
ATGTCGGGCT 
AATTTTGGAT 
TGAAGAAAAT 
ACCAAACCAC 
ACCCATGTTC 
CCGAGAGCAA 
ACTATCAACT 
GTTTAGAGAC 
TCGATTTCAA 
CTTTACCTCT 
AGGCATCCTG 
TATTGCTGTC 
GTCAACCCCA 
TTCCAAGACA 
GGGAACATAT 
CATGGGACGT 
TGTCAAAAAT 
TCTGGCCATA 
CGATTTCAAT 
ACCTTCCGTG 
TGCACTGGAA 



AAATACATCA 
CCAGATTTCC 
AATTTATGTT 
TGCCCAGGCG 
TATATTGAAA 
TGTGACACAT 
ATTATTATAA 
ACTACAAAAA 
TTTTTGTACA 
ATTAGAAAAC 
GTCTTTAAAG 
GAGGTGGAAA 
TGTGAAGCAA 
AGGTTGCTTG 
CTCTTTACCT 
AGAGATCTTT 
TCATACCGGT 
TCAAAGCAGC 
TATAATGCCT 



51 
I 

VPENQNIjISN 
VIVTDWYGAH 
WGVFDEYNND 
NSTQNATASI 
MNGTELPPPP 
IASFDSKGEI 
YGSVMILVTS 
SNSNSMIDAF 
QASGPPEIIL 
VTVTSRASNS 
PVTLRT.T.DDG 
YVPGYTANGN 
DLEAVKVEEE 
AGIREIFTFS 
NSDPVPARDY 



2580 
2S40 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



51 
1 

ACTTGGACTT 
GGGGCTCTGG 
AAAGGTCAAT 
GAATGGGTGA 
ATTGCCAAGA 
GTGGGAATCG 
AACATAACAG 
CTAAATGCCC 
ATTAATGATA 
AGTGCCTCAA 
TTGAATTCTA 
CTCCTAAGCA 
GCTAGCAGCT 
CAATGTGAAT 
TCTCAGTATT 
GTAACAGATT 
GGGAATGAAG 
AAAGTGGTGA 
AAAAACAAAG 
GTCACAATTC 
TTTACTGTGC 
CAAGCCATCG 
AACGATGGTG 
ATGAACCGAG 
GATGAATACA 
GACAATTGTC 
GTTGTCTCCG 
GATCAACCTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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TAAAGTTGCC TGCCGTATGG AGTATCACAA CCCTCAATGC TACCTCGGCC CTCCTCAGAG 1740 

CCCAGGAACA GATACCTCCT GGAGTATACC ACATCTCCCT GGTACTTACA GACAGTCAGA 1800 

ACAATCGGTG TGAGATGCCA CGCAGCTTGA CACTGGAAGT CTGTCAGTGT GACAACAGGG 1860 

GCATCTGTGG AACTTCTTAC CCAACCACAA GCCCTGGGAC CAGGTATGGC AGGCCGCACT 1920 

5 CAGGGAGGCT GGGGCCTGCC GCCATCGGCC TGCTGCTCCT TGGTCTCCTG CTGCTGCTGT 1980 

TGGCCCCCCT TCTGCTGTTG ACCTGTGACT GTGGGGCAGG TTCTACTGGG GGAGTGACAG 2040 

GTGGTTTTAT CCCAGTTCCT GATGGCTCAG AAGGAACAAT TCATCAGTGG GGAATTGAAG 2100 

GAGCCCATCC TGAAGACAAG GAAATCACAA ATATTTGTGT GCCTCCTGTA ACAGCCAATG 2160 

GAGCCGATTT CATGGAAAGT TCTGAAGTTT GTACAAATAC GTATGCCAGA GGCACAGCGG 2220 

10 TGGAAGGCAC TTCAGGAATG GAAATGACCA CTAAGCTTGG AGCAGCCACT GAATCTGGAG 2280 

GTGCTGCAGG CTTTGCAACA GGGACAGTGT CAGGAGCTGC TTCAGGATTC GGAGCAGCCA 2340 

CTGGAGTTGG CATCTGTTCC TCAGGGCAGT CTGGAACCAT GAGAACAAGG CATTCCACTG 2400 

GAGGAACCAA TAAGGACTAC GCTGATGGGG CGATAAGCAT GAATTTTCTG GACTCCTACT 2460 

TTTCTCAGAA AGCATTTGCC TGTGCGGAGG AAGACGATGG CCAGGAAGCA AATGACTGCT 2520 

15 TGTTGATCTA TGATAATGAA GGCGCAGATG CCACTGGTTC TCCTGTGGGC TCCGTGGGTT 2580 

GTTGCAGTTT TATTGCTGAT GACCTGGATG ACAGCTTCTT GGACTCACTT GGACCCAAAT 2640 

TTAAAAAACT TGCAGAGATA AGCCTTGGTG TTGATGGTGA AGGCAAAGAA GTTCAGCCAC 2700 

CCTCTAAAGA CAGCGGTTAT GGGATTGAAT CCTGTGGCCA TCCCATAGAA GTCCAGCAGA 2760 

CAGGATTTGT TAAGTGCCAG ACTTTGTCAG GAAGTCAAGG AGCTTCTGCT TTGTCCGCCT 2820 

20 CTGGGTCTGT CCAGCCAGCT GTTTCCATCC CTGACCCTCT GCAGCATGGT AACTATTTAG 2880 

TAACGGAGAC TTACTCGGCT TCTGGTTCCC TCGTGCAACC TTCCACTGCA GGCTTTGATC 2940 

CACTTCTCAC ACAAAATGTG ATAGTGACAG AAAGGGTGAT CTGTCCCATT TCCAGTGTTC 3000 

CTGGCAACCT AGCTGGCCCA ACGCAGCTAC GAGGGTCACA TACTATGCTC TGTACAGAGG 3060 

ATCCTTGCTC CCGTCTAATA TGACCAGAAT GAGCTGGAAT ACCACACTGA CCAAATCTGG 3120 

25 ATCTTTGGAC TAAAGTATTC AAAATAGCAT AGCAAAGCTC ACTGTATTGG GCTAATAATT 3180 

TGGCACTTAT TAGCTTCTCT CATAAACTGA TCACGATTAT AAATTAAATG TTTGGGTTCA 3240 

TACCCCAAAA GCAATATGTT GTCACTCCTA ATTCTCAAGT ACTATTCAAA TTGTAGTAAA 3300 
TCTTAAAGTT TTTCAAAACC CTAAAATCAT ATTCGC 



30 



Seq ID NO: 199 Protein Sequence 
Protein Accession #i NP_001935 



1 11 21 31 41 51 

« 1 I I ' II 

J J MMGLFPRTTG ALAIFVWIL VHGELRIETK GQYDEEEMTM QQAKRRQKRE WVKFAKPCRE 60 

GEDNSKRNPI AKITSDYQAT QK.ITYRISGV GIDQPPFGIF WDKNTGDIN ITAIVDREET 120 

PSFLITCRAL NAQGLDVEKP LILTVKILDI NDNPPVFSQQ IFMGEIEENS ASNSLVMILN 180 

ATDADEPNHL NSKIAFKIVS QEPAGTPMFL LSRNTGEVRT LTNSLDREQA SSYRLWSGA 240 

DKDGEGLSTQ CECNIKVKDV NDNFPMFRDS QYSARIEENI LSSELLRFQV TDLDEEYTDN 300 

40 WLAVYFFTSG NEGNWFEIQT DPRTNEGILK WKALDYEQL QSVKLSIAVK NKAEFHQSVI 360 

SRYRVQSTPV TIQVINVREG IAFRPASKTF TVQKGISSKK LVDYILGTYQ AIDEDTNKAA 420 

SNVKYVMGRN DGGYLMIDSK TAEIKPVKNM NRDSTFIVNK TITAEVLAID EYTGKTSTGT 480 

VYVRVPDFND NCPTAVLEKD AVCSSSPSW VSARTLNNRY TGPYTFALED QPVKLPAVWS 540 

ITTLNATSAL LRAQEQIPPG VYHISLVLTD SQNNRCEMPR SLTLEVCQCD NRGICGTSYP 600 

45 TTSPGTRYGR PHSGRLGPAA IGLLLLGLLL LLLAPLLLLT CDCGAGSTGG VTGGFIPVPD 660 

GSEGTIHQWG IEGAHPEDKE ITNICVPPVT ANGADFMESS EVCTNTYARG TAVEGTSGME 720 

MTTKLGAATE SGGAAGFATG TVSGAASGFG AATGVGICSS GQSGTMRTRH STGGTNKDYA 780 

DGAISMNFLD SYFSQKAFAC AEEDDGQEAN DCLLIYDNEG ADATGSPVGS VGCCSFIADD 840 

LDDSFLDSLG PKFKKLAEIS LGVDGEGKEV QPPSKDSGYG IESCGHPIEV QQTGFVKCQT 900 

50 LSGSQGASAL SASGSVQPAV SIPDPLQHGN YLVTETYSAS GSLVQPSTAG FDPLLTQNVI 960 
VTERVICPIS SVPGNLAGPT QLRGSHTMLC TEDPCSRLI 

Seq ID NOr 200 DNA Sequence 
55 Nucleic Acid Accession 8: NM_020411 
Coding sequence: 86. .526 

1 11 21 31 41 51 

, n I I I I II 

OU GGACCTGGGA AGGAGCATAG GACAGGGCAA GGCGGGATAA GGAGGGGCAC CACAGCCCTT 60 

AAGGCACGAG GGAACCTCAC TGCGCATGCT CCTTTGGTGC CCACCTCAGT GCGCATGTTC 120 

ACTGGGCGTC TTCCCATCGG CCCCTTCGCC AGTGTGGGGA ACGCGGCGGA GCTGTGAGCC 180 

GGCGACTCGG GTCCCTGAGG TCTGGATTCT TTCTCCGCTA CTGAGACACG GCGGACACAC 240 

. _ ACAAACACAG AACCACACAG CCAGTCCCAG GAGCCCAGTA ATGGAGAGCC CCAAAAAGAA 300 

65 GAACCAGCAG CTGAAAGTCG GGATCCTACA CCTGGGCAGC AGACAGAAGA AGATCAGGAT 360 

ACAGCTGAGA TCCCAGTGCG CGACATGGAA GGTGATCTGC AAGAGCTGCA TCAGTCAAAC 420 

ACCGGGGATA AATCTGGATT TGGGTTCCGG CGTCAAGGTG AAGATAATAC CTAAAGAGGA 480 

ACACTGTAAA ATGCCAGAAG CAGGTGAAGA GCAACCACAA GTTTAAATGA AGACAAGCTG 540 

AAACAACGCA AGCTGGTTTT ATATTAGATA TTTGACTTAA ACTATCTCAA TAAAGTTTTG 600 

70 CAGCTTTCAC CAAAAAAAAA AAAAAA 

Seq ID NO: 201 Protein Sequence 
Protein Accession 8: NP_065144.1 

75 1 11 21 31 41 51 

I I I I I I 

MLLWCPPQCA CSLGVFPSAP SPVWGTRRSC EPATRVPEVW ILSPLLRHGG HTQTQNHTAS 60 

PRSPVMESPK KKNQQLKVGI LHLGSRQKKI RIQLRSQCAT WKVICKSCIS QTPGINLDLG 120 
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SGVKVKIIPK EEHCKMPEAG EEQPQV 

Seq ID NO i 202 DNA Sequence 
Nucleic Acid Accession 8: AA172056 
Coding sequence: 121.. 339 

1 11 21 31 41 51 

I I I i I I 

TTTAGCCACC AGAGGANTTC TCTTGAAATA CCCAAAATCC ATCAGTATCT TGAATCATGC 60 

TGGATTTTGA AGAATTCTTA AGAAGCCATG TAAAGGGGGC TCTCTGGCCT TGAAATAGTG 120 

ATGTTTTTTA TACAGAAAGG AGAATGCAGA ATGGTCAGAC TATCATGCAC TGTTAAATTT 180 

GATTTCAAGA AATTACAGGA AAACTTTCCA AAGTTCCATC TCACAGAANN TTATTTTNCC 240 

AAGAATTCCA AGATAAGTTT AGTTTTATGG AAGACTTTTA TGTGGTTTTT ACTCACTCTT 300 

CATCTCAGAC ATCGACAGAT GATTACATCA CTTATAGTTC TAGTAAATTT ATTAATATAA 360 

AACTCAGAGA CATTCCAATA TCCACATTGC TTACACCATT AGGCATAGAT TCAGTGTCAG 420 

CTATGACAAT TGAAAATGAG CTGTTTTGTG ATTTAAAGGT TTAAATTTCT CTAACGAAAC 480 

TGCTTGATCC AGATGCAGGA CTGCAAATGT TAATATTTGT TCTGGAAGAA CAATCAAATA 540 

AGACTTAAGA GGAAAGGGAA TGGCCACAAT CCACCTGAAA TTTTTTCTTA AAAAGTGTGC 600 

AGCCTACTAA ATCAGAATGA AAATAGAAGT ACAAGATTAT AAACAAAATG CAATCAAACT 660 

TTTCTTAAGC TTACCTAAAG TTATTTCATC TGAAAATTTC AAGCAACTTT GTTCAACATT 720 

AAATTGACAA TCTAAACTAA CAAGTCTTTT GAATTTATGC ATGGTAGTAA ACATTCTCTC 780 

TATTAACTTT ATTACCTAAG GCTAAACCTA AAATTTTTAA GCAAAATTAG AAAAATAGTC 840 

TTCACTCATC AAAAAATAAA GTTTGTTACA TTTAGTATTT TCCCAATAAA ATTGGTCGTT 900 

CTTGGTTTTT TATTTGGAGA GTCTGTGCAA AATGTCACTA AAAATAAATT AGCACTAGAA 960 
ATTATTTCTA AATACCAAA 

Seq ID NO: 203 DNA Sequence 

Nucleic Acid Accession fh NM_00S656.1 

Coding sequence: 57,. 1535 

1 11 21 31 41 51 

I I I I I I 

GTCATATTGA ACATTCCAGA TACCTATCAT TACTCGATGC TGTTGATAAC AGCAAGATGG 60 

CTTTGAACTC AGGGTCACCA CCAGCTATTG GACCTTACTA TGAAAACCAT GGATACCAAC 120 

CGGAAAACCC CTATCCCGCA CAGCCCACTG TGGTCCCCAC TGTCTACGAG GTGCATCCGG 180 

CTCAGTACTA CCCGTCCCCC GTGCCCCAGT ACGCCCCGAG GGTCCTGACG CAGGCTTCCA 240 

ACCCCGTCGT CTGCACGCAG CCCAAATCCC CATCCGGGAC AGTGTGCACC TCAAAGACTA 300 

AGAAAGCACT GTGCATCACC TTGACCCTGG GGACCTTCCT CGTGGGAGCT GCGCTGGCCG 360 

CTGGCCTACT CTGGAAGTTC ATGGGCAGCA AGTGCTCCAA CTCTGGGATA GAGTGCGACT 420 

CCTCAGGTAC CTGCATCAAC CCCTCTAACT GGTGTGATGG CGTGTCACAC TGCCCCGGCG 480 

GGGAGGACGA GAATCGGTGT GTTCGCCTCT ACGGACCAAA CTTCATCCTT CAGATGTACT 540 

CATCTCAGAG GAAGTCCTGG CACCCTGTGT GCCAAGACGA CTGGAACGAG AACTACGGGC 600 

GGGCGGCCTG CAGGGACATG GGCTATAAGA ATAATTTTTA CTCTAGCCAA GGAATAGTGG 660 

ATGACAGCGG ATCCACCAGC TTTATGAAAC TGAACACAAG TGCCGGCAAT GTCGATATCT 720 

ATAAAAAACT GTACCACAGT GATGCCTGTT CTTCAAAAGC AGTGGTTTCT TTACGCTGTT 780 

TAGCCTGCGG GGTCAACTTG AACTCAAGCC GCCAGAGCAG GATCGTGGGC GGTGAGAGCG 840 

CGCTCCCGGG GGCCTGGCCC TGGCAGGTCA GCCTGCACGT CCAGAACGTC CACGTGTGCG 900 

GAGGCTCCAT CATCACCCCC GAGTGGATCG TGACAGCCGC CCACTGCGTG GAAAAACCTC 960 

TTAACAATCC ATGGCATTGG ACGGCATTTG OGGGGATTTT GAGACAATCT TTCATGTTCT 1020 

ATGGAGCGGG ATACCAAGTA CAAAAAGTGA TTTCTCATCC AAATTATGAC TCCAAGACCA 1080 

AGAACAATGA CATTGCGCTG ATGAAGCTGC AGAAGCCTCT GACTTTCAAC GACCTAGTGA 1140 

AACCAGTGTG TCTGCCCAAC CCAGGCATGA TGCTGCAGCC AGAACAGCTC TGCTGGATTT 1200 

CCGGGTGGGG GGCCACCGAG GAGAAAGGGA AGACCTCAGA AGTGCTGAAC GCTGCCAAGG 1260 

TGCTTCTCAT TGAGACACAG AGATGCAACA GCAGATATGT CTATGACAAC CTGATCACAC 1320 

CAGCCATGAT CTGTGCCGGC TTCCTGCAGG GGAACGTCGA TTCTTGCCAG GGTGACAGTG 1380 

GAGGGCCTCT GGTCACTTCG AACAACAATA TCTGGTGGCT GATAGGGGAT ACAAGCTGGG 1440 

GTTCTGGCTG TGCCAAAGCT TACAGACCAG GAGTGTACGG GAATGTGATG GTATTCACGG 1500 

ACTGGATTTA TCGACAAATG AAGGCAAACG GCTAATCCAC ATGGTCTTCG TCCTTGACGT 1560 

CGTTTTACAA GAAAACAATG GGGCTGGTTT TGCTTCCCCG TGCATGATTT ACTCTTAGAG 1620 

ATGATTCAGA GGTCACTTCA TTTTTATTAA ACAGTGAACT TGTCTGGCTT TGGCACTCTC 1680 

TGCCATACTG TGCAGGCTGC AGTGGCTCCC CTGCCCAGCC TGCTCTCCCT AACCCCTTGT 1740 

CCGCAAGGGG TGATGGCCGG CTGGTTGTGG GCACTGGCGG TCAATTGTGG AAGGAAGAGG 1800 

GTTGGAGGCT GCCCCCATTG AGATCTTCCT GCTGAGTCCT TTCCAGGGGC CAATTTTGGA 1860 

TGAGCATGGA GCTGTCACTT CTCAGCTGCT GGATGACTTG AGATGAAAAA GGAGAGACAT 1920 

GGAAAGGGAG ACAGCCAGGT GGCACCTGCA GCGGCTGCCC TCTGGGGCCA CTTGGTAGTG 1980 

TCCCCAGCCT ACTTCACAAG GGGATTTTGC TGATGGGTTC TTAGAGCCTT AGCAGCCCTG 2040 

GATGGTGGCC AGAAATAAAG GGACCAGCCC TTCATGGGTG GTGACGTGGT AGTCACTTGT 2100 

AAGGGGAACA GAAACATTTT TGTTCTTATG GGGTGAGAAT ATAGACAGTG CCCTTGGTGC 2160 

GAGGGAAGCA ATTGAAAAGG AACTTGCCCT GAGCACTCCT GGTGCAGGTC TCCACCTGCA 2220 

CATTGGGTGG GGCTCCTGGG AGGGAGACTC AGCCTTCCTC CTCATCCTCC CTGACCCTGC 2280 

TCCTAGCACC CTGGAGAGTG AATGCCCCTT GGTCCCTGGC AGGGCGCCAA GTTTGGCACC 2340 

ATGTCGGCCT CTTCAGGCCT GATAGTCATT GGAAATTGAG GTCCATGGGG GAAATCAAGG 2400 

ATGCTCAGTT TAAGGTACAC TGTTTCCATG TTATGTTTCT ACACATTGAT GGTGGTGACC 2460 
CTGAGTTCAA AGCCATCTT 

Seq ID NO: 204 Protein Sequence 
Protein Accession fl: NP_005647.1 
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WO 03/003906 



1 11 21 31 41 51 

I I I I I I 

MALNSGSPPA IGPYYENHGY QPENPYPAQP TWPTVYEVH PAQYYPSPVP QYAPRVLTQA 60 

SNPWCTQPK SPSGTVCTSK TKKALCITLT LGTFLVGAAL AAGLLWKFMG SKCSNSGIEC 120 

DSSGTCINPS NWCDGVSHCP GGEDENRCVR LYGPNFILQM YSSQRXSWHP VCQDDWNENY 180 

GRAACRDMGY KNNFYSSQGI VDDSGSTSFM KLNTSAGNVD IYKKLYHSDA CSSKAWSLR 240 

CLACGVNLNS SRQSRIVGGE SALPGAWPWQ VSLHVQNVHV CGGSIITPEW IVTAAHCVEK 300 

PLNNPWHWTA FAGIIiRQSFM FYGAGYQVQK VISHPNYDSK TKNNDIALMK LQKPLTFNDL 360 

VKPVCLPNPG MMLQPEQLCW ISGWGATEEK GKTSEVLNAA KVLLIETQRC NSRYVYDNLI 420 

TPAMICAGFL QGNVDSCQGD SGGPLVTSNN NIWWLIGDTS WGSGCAKAYR PGVYGNVMVF 480 
TDWIYRQMKA NG 

Seq ID NO: 205 DNA Sequence 
Nucleic Acid Accession ft: XM_044533 
Coding sequence: 238.. 2751 

1 11 21 31 41 51 

I I I I I I 

GCTCTGCCCA AGCCGAGGCT GCGGGGCCGG CGCCGGCGGG AGGACTGCGG TGCCCCGCGG 60 

AGGGGCTGAG TTTGCCAGGG CCCACTTGAC CCTGTTTCCC ACCTCCCGCC CCCCAGGTCC 120 

GGAGGCGGGG GCCCCCGGGG CGACTCGGGG GCGGACCGCG GGGCGGAGCT GCCGCCCGTG 180 

AGTCCGGCCG AGCCACCTGA GCCCGAGCCG CGGGACACCG TCGCTCCTGC TCTCCGAATG 240 

CTGCGCACCG CGATGGGCCT GAGGAGCTGG CTCGCCGCCC CATGGGGCGC GCTGCCGCCT 300 

CGGCCACCGC TGCTGCTGCT CCTGCTGCTG CTGCTCCTGC TGCAGCCGCC GCCTCCGACC 360 

TGGGCGCTCA GCCCCCGGAT CAGCCTGCCT CTGGGCTCTG AAGAGCGGCC ATTCCTCAGA 420 

TTCGAAGCTG AACACATCTC CAACTACACA GCCCTTCTGC TGAGCAGGGA TGGCAGGACC 480 

CTGTACGTGG GTGCTCGAGA GGCCCTCTTT GCACTCAGTA GCAACCTCAG CTTCCTGCCA 540 

GGCGGGGAGT ACCAGGAGCT GCTTTGGGGT GCAGACGCAG AGAAGAAACA GCAGTGCAGC 600 

TTCAAGGGCA AGGACCCACA GCGCGACTGT CAAAACTACA TCAAGATCCT CCTGCCGCTC 660 

AGCGGCAGTC ACCTGTTCAC CTGTGGCACA GCAGCCTTCA GCCCCATGTG TACCTACATC 720 

AACATGGAGA ACTTCACCCT GGCAAGGGAC GAGAAGGGGA ATGTCCTCCT GGAAGATGGC 780 

AAGGGCCGTT GTCCCTTCGA CCCGAATTTC AAGTCCACTG CCCTGGTGGT TGATGGCGAG 840 

CTCTACACTG GAACAGTCAG CAGCTTCCAA GGGAATGACC CGGCCATCTC GCGGAGCCAA 900 

AGCCTTCGCC CCACCAAGAC CGAGAGCTCC CTCAACTGGC TGCAAGACCC AGCTTTTGTG 960 

GCCTCAGCCT ACATTCCTGA GAGCCTGGGC AGCTTGCAAG GCGATGATGA CAAGATCTAC 1020 

TTTTTCTTCA GCGAGACTGG CCAGGAATTT GAGTTCTTTG AGAACACCAT TGTGTCCCGC 1080 

ATTGCCCGCA TCTGCAAGGG CGATGAGGGT GGAGAGCGGG TGCTACAGCA GCGCTGGACC 1140 

TCCTTCCTCA AGGCCCAGCT GCTGTGCTCA CGGCCCGACG ATGGCTTCCC CTTCAACGTG 1200 

CTGCAGGATG TCTTCACGCT GAGCCCCAGC CCCCAGGACT GGCGTGACAC CCTTTTCTAT 1260 

GGGGTCTTCA CTTCCCAGTG GCACAGGGGA ACTACAGAAG GCTCTGCCGT CTGTGTCTTC 1320 

ACAATGAAGG ATGTGCAGAG AGTCTTCAGC GGCCTCTACA AGGAGGTGAA CCGTGAGACA 1380 

CAGCAGTGGT ACACCGTGAC CCACCCGGTG CCCACACCCC GGCCTGGAGC GTGCATCACC 1440 

AACAGTGCCC GGGAAAGGAA GATCAACTCA TCCCTGCAGC TCCCAGACCG CGTGCTGAAC 1500 

TTCCTCAAGG ACCACTTCCT GATGGACGGG CAGGTCCGAA GCCGCATGCT GCTGCTGCAG IS 60 

CCCCAGGCTC GCTACCAGCG CGTGGCTGTA CACCGCGTCC CTGGCCTGCA CCACACCTAC 1620 

GATGTCCTCT TCCTGGGCAC TGGTGACGGC CGGCTCCACA AGGCAGTGAG CGTGGGCCCC 1680 

CGGGTGCACA TCATTGAGGA GCTGCAGATC TTCTCATCGG GACAGCCCGT GCAGAATCTG 1740 

CTCCTGGACA CCCACAGGGG GCTGCTGTAT GCX3GCCTCAC ACTCGGGCGT AGTCCAGGTG 1800 

CCCATGGCCA ACTGCAGCCT GTACAGGAGC TGTGGGGACT GCCTCCTCGC CCGGGACCCC 1860 

TACTGTGCTT GGAGCGGCTC CAGCTGCAAG CACGTCAGCC TCTACCAGCC TCAGCTGGCC 1920 

ACCAGGCCGT GGATCCAGGA CATCGAGGGA GCCAGCGCCA AGGACCTTTG CAGCGCGTCT 1980 

TCGGTTGTGT CCCCGTCTTT TGTACCAACA GGGGAGAAGC CATGTGAGCA AGTCCAGTTC 2040 

CAGCCCAACA CAGTGAACAC TTTGGCCTGC COGCTCCTCT CCAACCTGGC GACCCGACTC 2100 

TGGCTACGCA ACGGGGCCCC CGTCAATGCC TCGGCCTCCT GCCACGTGCT ACCCACTGGG 2160 

GACCTGCTGC TGGTGGGCAC CCAACAGCTG GGGGAGTTCC AGTGCTGGTC ACTAGAGGAG 2220 

GGCTTCCAGC AGCTGGTAGC CAGCTACTGC CCAGAGGTGG TGGAGGACGG GGTGGCAGAC 2280 

CAAACAGATG AGGGTGGCAG TGTACCCGTC ATTATCAGCA CATCGCGTGT GAGTGCACCA 2340 

GCTGGTGGCA AGGCCAGCTG GGGTGCAGAC AGGTCCTACT GGAAGGAGTT CCTGGTGATG 2400 

TGCACGCTCT TTGTGCTGGC CGTGCTGCTC CCAGTTTTAT TCTTGCTCTA CCGGCACCGG 2460 

AACAGCATGA AAGTCTTCCT GAAGCAGGGG GAATGTGCCA GCGTGCACCC CAAGACCTGC 2520 

CCTGTGGTGC TGCCCCCTGA GACCCGCCCA CTCAACGGCC TAGGGCCCCC TAGCACCCCG 2580 

CTCGATCACC GAGGGTACCA GTCCCTGTCA GACAGCCCCC CGGGGTCCCG AGTCTTCACT 2640 

GAGTCAGAGA AGAGGCCACT CAGCATCCAA GACAGCTTCG TGGAGGTATC CCCAGTGTGC 2700 

CCCCGGCCCC GGGTCCGCCT TGGCTCGGAG ATCCGTGACT CTGTGGTGTG AGAGCTGACT 2760 

TCCAGAGGAC GCTGCCCTGG CTTCAGGGGC TGTGAATGCT CGGAGAGGGT CAACTGGACC 2820 

TCCCCTCCGC TCTGCTCTTC GTGGAACACG ACCGTGGTGC CCGGCCCTTG GGAGCCTTGG 2880 

GGCCAGCTGG CCTGCTGCTC TCCAGTCAAG TAGCGAAGCT CCTACCACCC AGACACCCAA 2 940 

ACAGCCGTGG CCCCAGAGGT CCTGGCCAAA TATGGGGGCC TGCCTAGGTT GGTGGAACAG 3000 

TGCTCCTTAT GTAAACTGAG CCCTTTGTTT AAAAAACAAT TCCAAATGTG AAACTAGAAT 3060 

GAGAGGGAAG AGATAGCATG GCATGCAGCA CACACGGCTG CTCCAGTTCA TGGCCTCCCA 3120 

GGGGTGCTGG GGATGCATCC AAAGTGGTTG TCTGAGACAG AGTTGGAAAC CCTCACCAAC 3180 

TGGCCTCTTC ACCTTCCACA TTATCCCGCT GCCACCGGCT GCCCTGTCTC ACTGCAGATT 3240 

CAGGACCAGC TTGGGCTGCG TGCGTTCTGC CTTGCCAGTC AGCCGAGGAT GTAGTTGTTG 3300 

CTGCOGTCGT CCCACCACCT CAGGGACCAG AGGGCTAGGT TGGCACTGCG GCCCTCACCA 3360 

GGTCCTGGGC TCGGACCCAA CTCCTGGACC TTTCCAGCCT GTATCAGGCT GTGGCCACAC 3420 

GAGAGGACAG CGCGAGCTCA GGAGAGATTT CGTGACAATG TACGCCTTTC CCTCAGAATT 3480 

CAGGGAAGAG ACTGTCGCCT GCCTTCCTCC GTTGTTGCGT GAGAACCCGT GTGCCCCTTC 3540 
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CCACCATATC CACCCTCGCT CCATCTTTGA ACTCAAACAC GAGGAACTAA CTGCACCCTG 3600 

GTCCTCTCCC CAGTCCCCAG TTCACCCTCC ATCCCTCACC TTCCTCCACT CTAAGGGATA 3660 

TCAACACTGC CCAGCACAGG GGCCCTGAAT TTATGTGGTT TTTATACATT TTTTAATAAG 3720 
ATGCACTTTA TGTCATTTTT TAATAAAGTC TGAAGAATTA CTGTTT 

Seq ID NO: 206 Protein Sequence 
Protein Accession #: XP_044533.6 

1 11 21 31 41 51 

I I I I I I 

MLRTAMGLRS WLAAPWGALP PRPPLLLLLL LLLLLQPPPP TWALSPRISL PLGSEERPFL 60 

RFEAEHISNY TALLLSRDGR TLYVGAREAL FALSSNLSFL PGGEYQELLW GADAEKKQQC 120 

SPKGKDPQRD CQNYIKILLP LSGSHLPTCG TAAFSPMCTY INMENFTLAR DEKGNVLLED 180 

GKGRCPFDPN FKSTALWDG ELYTGTVSSF QGNDPAISRS QSLRPTKTES SLNWLQDPAF 240 

VASAYIPESL GSLQGDDDKI YPFPSETGQE FEFFENTIVS RIARICKGDE GGERVLQQRW 300 

TSFLKAQLLC SRPDDGFPFN VLQDVFTLSP SPQDWRDTLF YGVFTSQWHR GTTEGSAVCV 360 

FTMKDVQRVF SGLYKEVNRE TQQWYTVTHP VPTPRPGACI TNSARERKIN SSLQLPDRVL 420 

NFLKDHFLMD GQVRSRMLLL QPQARYQRVA VHRVPGLHHT YDVLFLGTGD GRLHKAVSVG 480 

PRVHIIEELQ IFSSGQPVQN LLLDTHRGLL YAASHSGWQ VPMAMCSLYR SCGDCLLARD 540 

PYCAWSGSSC KHVSLYQPQL ATRPWIQD1E GASAKDLCSA SSWSPSFVP TGEKPCEQVQ 600 

FQPNTVNTLA CPLLSNLATR LHLRNGAPVN ASASCHVLPT GDLLLVGTQQ LGEFQCWSLE 660 

EGFQQLVASY CPEWEDGVA DQTDBGGSVP VIISTSRVSA PAGGKASWGA DRSYWKEFLV 720 

MCTLFVLAV1. LPVLFLLYRH RNSMKVFLKQ GECASVHPKT CPWLPPETR PLNGLGPPST 780 
PLDHRGYQSL SDSPPGSRVF TESEKRPLSI QDSFVEVSPV CPRPRVRLGS EIRDSW 
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It is understood that the examples described above in no way serve to limit the true 
scope of this invention, but rather are presented for illustrative purposes. All publications, 
sequences of accession numbers, and patent applications cited in this specification are herein 
5 incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 



1 . A method of detecting a bladder cancer-associated transcript in a cell from a patient, 
5 the method comprising contacting a biological sample from the patient with a polynucleotide 
that selectively hybridizes to a sequence at least 80% identical to a sequence as shown in 
Tables 1A-13. 



2. The method of Claim 1 , wherein the biological sample comprises isolated nucleic 
10 acids. 



3 . The method of Claim 2 : 

a) wherein the nucleic acids are mRNA; or 

b) further comprising the step of amplifying nucleic acids before the step of contacting 
15 the biological sample with the polynucleotide. 



20 



The method of Claim 1, wherein the polynucleotide: 

a) comprises a sequence as shown in Tables 1 A- 13; or 

b) is immobilized on a solid surface. 

The method of Claim 1, wherein the patient is: 

a) undergoing a therapeutic regimen to treat bladder cancer; or 

b) suspected of having bladder cancer. 



25 6. An isolated nucleic acid molecule consisting of a polynucleotide sequence as shown 
in Tables 1A-13. 



7. The nucleic acid molecule of Claim 6, which is labeled. 
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8. An expression vector comprising the nucleic acid of Claim 7. 
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9. A host cell comprising the expression vector of Claim 8. 

10. An isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1 A- 13. 

11. An antibody that specifically binds a polypeptide of Claim 1 0. 

12. The antibody of Claim 11, further conjugated to an effector component. 

13. The antibody of Claim 12, wherein the effector component is a fluorescent label. 

14. The antibody of Claim 12, wherein the effector component is a radioisotope or a 
cytotoxic chemical. 

15. The antibody of Claim 1 1 , which is 

a) an antibody fragment; or 

b) a humanized antibody 

16. A method of detecting a bladder cancer cell in a biological sample from a patient, the 
method comprising contacting the biological sample with an antibody of Claim 1 L 

1 7. The method of Claim 1 6, wherein the antibody is further conjugated to an effector 
component. 

18. The method of Claim 1 7, wherein the effector component is a fluorescent label. 

1 9. A method for identifying a compound that modulates a bladder cancer-associated 
polypeptide, the method comprising the steps of: 
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a) contacting the compound with a bladder cancer-associated polypeptide, the 
polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at 
least 80% identical to a sequence as shown in Tables 1 A-13; and 

b) determining the functional effect of the compound upon the polypeptide. 

5 

20. A drug screening assay comprising the steps of 

a) administering a test compound to a mammal having bladder cancer or a cell isolated 
therefrom; 

b) comparing the level of gene expression of a polynucleotide that selectively hybridizes 
10 to a sequence at least 80% identical to a sequence as shown in Tables 1A-13 in a 

treated cell or mammal with the level of gene expression of the polynucleotide in a 
control cell or mammal, wherein a test compound that modulates the level of 
expression of the polynucleotide is a candidate for the treatment of bladder cancer. 



306 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 




(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date (10) International Publication Number 

16 January 2003 (16.01.2003) PC T WO 03/003906 A3 



(51) International Patent Classification 7 : C12Q 1/68, 
C07H 21/04, C07K 14/00 

(21) International Application Number: PCT/US02/21338 

(22) International Filing Date: 3 July 2002 (03.07.2002) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 






60/302,814 


3 July 2001 (03.07.2001) 


US 


60/310,099 


3 August 2001 (03.08.2001) 


US 


60/343,705 


8 November 2001 (08.11.2001) 


us 


60/350,666 


13 November 2001 (13.11.2001) 


us 


60/372,246 


12 April 2002(12.04.2002) 


us 



(71) Applicant: EOS BIOTECHNOLOGY, INC. [US/US]; 
225 A Gateway Boulevard, South San Francisco, CA 94080 
(US). 

(72) Inventors: MACK, David, H.; 2076 Monterey Avenue, 
Menlo Park, CA 94025 (US). AZIZ, Natasha; 411 Cali- 
fornia Avenue, Palo Alto, CA 94306 (US). 

(74) Agents: PARENT, Annette, S. et ah; Townsend and 
Townsend and Crew LLP, Two Embarcadero Center, 
Eighth Floor, San Francisco, CA 94111 (US). 



(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, Fl, GB, GD, GE, GH, 
GM, HR, HU, ID, 1L, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SD, SE, SG, 
SI, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, UZ, VN, 
YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARJPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ, DE, DK, EE, 
ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE, SK, 
TR), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, 
GW, ML, MR, NE, SN, TD, TG). 

Published: 

— with international search report 

(88) Date of publication of the international search report: 

6 November 2003 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



3 

o 

On 

o 
o 

f*5 



(54) Title: DIAGNOSTIC AND SCREENING METHODS FOR BLADDER CANCER 

(57) Abstract: Described herein are genes whose expression are up-regulated or down-regulated in bladder cancer. Also described 
are such genes whose expression is further up-regulated or down-regulated in drug-resistant bladder cancer cells. Related methods 
and compositions that can be used for diagnosis, prognosis, or treatment of bladder cancer are disclosed. Also described herein are 
methods that can be used to identify modulators of bladder cancer. 



1 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US02/21338 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(7) : C12Q 1/68; C07H 21/04; C07K 14/00 

US CL : 536/24.3; 435/6; 514/2; 530/350. 300 
According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



M inimum documentation searched (classification system followed by classification symbols) 
U.S. : 536/24.3; 435/6; 514/2; 530/350, 300 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
Please See Continuation Sheet 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



Y.B 
Y 
A 



US 2003/0050470 Al (AN et al) 13 March 2003 (13.03.03). column page 6, par. 57-62; 
page page 13, par. 135-141; page 15, par. 158-163; page 30, par. 349-351. 
US 5,866,535 A (GBTZENBERG et al) 02 February 1999 (02.02.1999), column col. 5, 
lines 43-55; col. 10, lines 31-60; col. U, lines 30-67. 

SRINIVAS, P.R. Proteomics in Early Detection of Cancer. Clinical Chemistry. 22 June 
2001, Vol. 47, No. 10, pages 1901 -1911, especially pages 1906-1908. 



1-20 
1-5, 7-20 
1-20 



□ 



Further documents are listed in the continuation of Box C 



• □ 



See patent family annex. 



Special categories of c 



-A" 

"E* 
"L" 



document defining the general state of the an which is not considered to be 
of particular relevance 

earlier application or patent published on or after the international filing date 

document which may throw doubts on priority claim(s) or which is cited to 
establish the publication date of another citation or other special reason (as 
specified) 

document referring to an oral disclosure, use, exhibition or other means 

document published prior to the international filing date but later than the 
priority date claimed 



later document published after the international fili ng date or priority 
date and not in conflict with the application but cited to understand the 
principle or theory underlying the invention 

document of paiticular relevance; the claimed Invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 



document of particular relevance; the claimed invention 
considered to Involve an inventive step when the 
combined with one or more other such documents, 
being obvious to a person skilled in the art 



document member of the s 



5 patent family 



Date of the actual completion of the international search 
07 April 2003 (07.04.2003) 



Date of mailing of the international search report 




5i ine international search 

2 3 MAY ZOOT 



Name and mailing address of the ISA/US 

Commissioner of Patents and Trademarks 
BoxPCT 

Washington, D.C. 20231 

Facsimile No. (703)305-3230 



[morizgd 
GaryBenzion 
Telephone No. 703-308-1235 



Form PCT/ISA/210 (second sheet) (July 1998) 



